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MASS SPECTRQMETRIC METHODS FOR SEQUENCING NUCLEIC ACIDS 
RELATED APPLICATIONS 

Benefit of prioritv Qlaimed to U«S. application Serial 
No.08/990,851« filed December 15, 1997, to Changwon Kang, Young- 
S SoQ Kworip Young Tad Kim, Hubert Koater, Daniel Little, Maryanne 
O'Donnell, Guobing Xiang, Cliarlefi Cantor and David Lough, entitled 
-MASS SPECTRQMETRIC METHODS FOR SEQU0y|ClNG NUCLEIC 
ACID5^ Where permitted, the subject maner of this application is 
Incorporated herein by reference in its entirety. 
1 0 The subject matter cf this application is related to the following 

copending applications, international applications and paterrts: U.S. 
Patent Nos, 5,605.798, 5.622,824 end 5,547,835. 5.691,141, 
International PCT application No. WO 98/20019, WO 98/20166 and 
WO 97/42348. Where permitted, the subject matter of each of these 
1 S applications and patents is incorporated herein In its entirety. 
BACKGROUND OF THE INVENTION 

Nucleic acid sequencing methods are important and powerful tools 
In the molecular bioiogist's repository of techniques for assessing and 
understanding gene expression and reguiation. Methods for sequencing 
20 DNA molecules Indude chemical degradation sequencing (Maxam et bL 
(1 977) Proc. Natl. Acad. Sci. USA 74 ; 560, see, also, Ambrose gt aL 
(19871 Methods Enz. 152 : 522) and chain termination sequencing 
(Sanger et sL (1 977) Proc. NatL Acad. Sci, USA 74: 5463). The 
Maxam-Gilbert sequencing method is a degradative method that reltes on 
25 specific cleavage of DNA fragments. A fragment of DNA, which Is 
radiolabeled at one end, is partially cleaved in five separate chemical 
reactions. Each reaction is specific for a type of base or a base so that 
five populations of labeled fragments of differing lengths are generated. 
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The populations of fragments are resolved by polyacrylamide gel 
electrophoresis <PAGE). 

The chain termination or Sanger method relies on a DNA synthesis. 
Single-stranded DNA is used as a template end labeled primers are used 
5 to initiate formation of complementary strands. The synthesis reaction is 
mn In the presence of the four deoxynucfeotides (dNTPs), dATP, dTTP, 
dCTP and dGTP and one of die four dldeoxynLicleotides (ddNTPsK 
ddATP, ddTTP, ddCTP, and ddGTP. The complementary strands are 
prematurely cermtnated along the chain by addition of dideoxynucleotides 

10 to the growing chains. By starting with four reaction mixtures, whbh 
each contain one of die four ddNTPSr sets of strands terminating in A, 
G and T are generated. Each reaction mixture is electrophoresed in one 
lane of a poiyacrylamide gel that resoh^es fragments that differ by a 
single base in length. 

1 5 These methods are primarily designed tor sequencing single- 

stranded DNA, which is produced by denaturing the DNA and separating 
the single strands, or by other methods, such as cioning into single- 
stranded phage vectors. These sequencing methods rely on reactions 
that produce an array of fragments that differ in length by a single base 

20 and temninate In an identrfiable base. The fragments are resolved by size 
using PAGE and are detected using a label, such as a radioisotope. 
Because the resolution of bands on an electrophoretic gel decreases 
exponentially as the length of the DNA fragments increase, these 
methods only permh: DNA fragments of up to about 300 to 400 

25 nucleotides to be sequenced. 

The target DNA can also be amplified using the polymerase chain 
reaction (PGR) <see, e^, Mullis et al (1986) Cold Spring Harbor Svmp. 
Quant. Plol, 51: 263; U.S. Patent No. 4.683,202 to Mulfis ^ aL). which 
results in an amplified concentration of the dupl x target DNA. A 
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number of methods have been used to generate single-stranded 
templates directly f ronn PGR for subsequent sequencing. For example, 
radio^labeled primer that is specific for only one strand may be used. 
Alternatively, PGR may be run under conditions such that one primer is at 
5 limited concentration. Once the primer that is at the Hmited 

concentration is exhausted , the second strand Is ampfified at a linear rate 
through succeeding cycles (Gylienstein et aL (1988) Proc, Nati. Acad. 
ScL 85: 7852; Mihilovic gt aL (1989) BioTechnlaues 7^14^ 

There are numerous sequencing strategies in use. The &elected 

10 strategy depends upon the purpose for which the DNA Is sequenced and 
the amount of information avaiieble about the DNA prior to sequencing. 
For example. If the target DNA is sequenced in order to confrrm that a 
particular mutation has been introduced fnto the DNA, it may only be 
necessary to sequence a small region of DNA. If the DNA fragment is an 

15 unknown gene or portron of a gene for which a sequence must be 

accurately determined, then rt may be necessary to sequence the entire 
fragment. Because the sizes of fragments that can be sequenced are 
limited to about 400 bases, DNA fragments longer than this size must be 
cleaved. Cleavage may be random, by euboloning segments of the target 

20 DNA. The subcloned fragments, which include overlapprng fragments, 
are then sequenced, and ordered using a computer program (see, e.q„ 
Staden (1986) Nuoi. Acids Ras. 14^21 7K Alternatively, the DNA may be 
systematically subcloned by generating and sequencing overlapping or 
nested mutants or by other ordered approaches. 

25 The Sanger chain termination method and other sequencing 

methods rely on the use of singie-stranded template by cloning the target 
DNA Into single-stranded phage vectors. The use of plasmfds as vectors 
for the target DNA, however, is preferred over the use of phage DNA for 
reasons, which include the variety of avatiabJe piasmids, the ease with 
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whlch plasmids are manipulated, and the greater stability of inserted DNA 
In plasmids compared to phage vector&. Consequently, methods for 
sequencing in which the target DNA is cloned into piaamfd DNA, rather 
than into a single-stranded phage DNA <see, e.g., Waltance st gL (1981) 
5 Geiig 16: 21-26; Guo fiS fiL « 1 982) NucI, Acids Res. 10: 2065-2084; 
Vieira et {T98Z) Gene 19: 259-268) have been developed. 

These methods are designed to only sequence one strand of the 
target DNA at a time (see, e^, Chen et ah (1985) DNA 4: 16B-170; 
Hattori et aL (1 986) Anal> Biochsm. 152 : 232-238; Mierendorf at aL 

10 {1987} l^ethods Enzvmol. 1B2 : 556; Mehra gt aL (1986) Proo. WatJ. 
Acad. Sci. USA 83: 7013-7017). Use of double-stranded DNA, 
however, avoids the subcioning or isolation of single-stranded DiMA 
fragments, which are used for the didaoxy chain terminator sequencing 
reactions. The use of double-stranded DNA, however, had been limited 

15 because of the poor template quality of denatured duplex DNA. As a 
result, these methods had not provided as accurate sequence data as 
provided by methods In which the DNA is cloned into a single-stranded 
vector. 

Recently, the problems associated with template quality have been 
20 solved by the development of methods that use plasmids that mclude 
sites, adjacent to the complementary strands of the inserted DNA, to 
.which strand-specific primers may be efficiently hybrtdtzed. By virtue of 
these methods each of the single strands of double-stranded DNA can be 
sequenced directly from plasmid DNA without prior subcioning into 
25 phage vectors (see, a^ Chen et aL ((1985} DNA 4: 165-170 and Chi et 
n988) Nucl. Acids flee. 16: 10382). A strand specific synthetic 
primer is annealed to covaiently closed circular DNA, whrch has been 
denatured by heat or alkali, before proceeding with dideoxy sequencing 
reactions. Alternatively, the primer can be annealed to open circle 
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double-stranded plasmid DNA, which has been denatured by alkali, as a 
template (ass, Hattori et aL |{1986) Anal, Blochem. 152 : 232-238). The 
double-stranded DNA is denatured with elkafi or heat prior to sequencing 
U3ing the Sanger method, which fs performed at 37* C or higher. The 
5 use of different "forward" and "reverse" primers, which are each 
complementary to the lac Z sequences adjacent to the Eeo RI site in 
iigtl 1, for separately sequencing each strand of DNA that has been 
cloned into ^gtl 1 has also been described <see, Mehra et aL (1986) Proc, 
Natl. Acad, ScL USA &2i 7013-7017). 

10 Plasmids with oppositely oriented promoter regions, which are 

used in methods which involve transcription, are also used as vehicles 
for target DNA which is to be sequenced. Each promoter region serves 
as a distinct specific priming site for sequencing the inserted DNA. Such 
plasmids are commercially available. For example, the twin promoter 

15 plasmid pGEM™ contains the bacteriophage SP6 and T7 RNA polymerase 
promoters in opposite orientations (IVIierendorf §iaL (1987} Meth. 
gp^irw.la 152; 556-562} which can generate transcripts for sequencing. 

IWosi methods for sequencing nucleic acids require the use of 
polyacryJamide get electrophoresis (i.e., PAGE) that can result in 

20 sequencing artifacts or require detectable labels, such as radioisotopes, 
enzymes, or fluorescent or chemiluminescent moieties. Methods for 
sequencing of DNA using mass spectronneric detection are also In use 
(see. U.S. Patent No. 5,547,835}. These methods primarily rely on 
sequencing of DMA. 

25 Using DNA sequencing methodologies, the entire sequence af the 

human genome ultimately will be determined. The knowledge of the 
complete sequence off the human genome DNA will help to understand, 
to diagnose, to prevent and to treat human diseases. T b able to 
tackle successfuNy the determination of the approximately 3 billion base 
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pairs of the human genome tn a reasonable time frame and in an 
economical way, rapid, reliable, sensitive and inexpensive methods are 
required. 

ITierefore it is an object herein to provide additional methods for 
5 sequencing. In particular, it is an object herein to provide mass 
spaotrometric methods of sequencing nudeic acids using RNA 
polymerase, it is a further object herein to pro^^de methods of 
sequencing nudeio acids In an array format using RNA polymerase In 
which nucleic acid probes are immobilized to supports at high densities 

10 to facilitate mass spectrametric detection. It is also an object herein to 
provide methods for identifying transcriptional terminator sequences 
using mass spectrometric methods. 
SUMMARY OF THE rNVENnON 

Methods for sequencing nucleic acids, partiouarly DNA, using RNA 

15 transcripts are provided. In particular, methods for sequencing nucleic 
acids using RNA polymerases, including DNA-dependent and RNA- 
dependent RNA polymerasas. are provided. RNA transcripts are analyzed 
by mass spectrometry. 

In practicing the methods, a DNA molecule comprising a promoter 

20 and target nucleic acid sequence operatively linked to the promoter is 
transcribed with an RNA polymerase that recognizes the promoter under 
coi^Itions that generate a nested set of RNA transcripts that include the 
target is produced. The molecular weight of the transcripts are 
determined by mass spectrometry to thereby determine the nucleic add 

25 sequence of the target sequence. The DNA molecule that contains the 
nucteio acid and target is preferably immobiiized. 

In certabi embodiments, the methods use a double-stranded 
promoter-containing fragment with a single-stranded region at one and. 
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whlch captures a molecule that contafns Ihe target DNA. In these 
embodKments, the target DNA is preferably single-stranded. 

In preferred embodimentd, the promoter-containing nuclarc acid is 
covalently coupled via the S'-end of the noncoding (antiaenee) strand or 
5 5'^end of the coding strand to a solid support. Coupling can be by any 
method known to those of skill in the art, such as via a thiol linkage. For 
example, 5'- or 3'-thiolated DMA is prepared and linked at to a 
aminosilane-treated solid support. The linkage may be direct or via a 
linker group. The resulting immobilized molecules are preferably arranged 
10 in an array format 

A douWe-stranded nucleic acid molecule encoding a promoter 
sequence can obtained from any source, such as bacterra, vrruses, 
bacteriophages, plants and eufcaryotic organisms, or assembled by 
synthetic protocols. It Is engineered using so that the resulting fragment 
15 contains a single-stranded region of at least a 5 nucleotides at the 3'-end 
of the codfng (sense) atrand. This single-stranded region Is designed 
such that it is complementary to a region of the nucleic acid to be 
sequenced or to a common overlapping sequencer such as a restriction 
endonuclease site. 

20 The target nucleic acid, which fnlcudes the region of nucleic acid 

to be sequenced is singla-stranded or double-stranded but includes a 
single-stranded 3'-end, rs hybridized to the complementary sequences of 
the promoter-containing DNA< Hybridization Is performed either before or 
after immobilization. The hybridization of the two nucleic acid 

25 molecules intrcduces one or more "nicks'' In the hybrid at the junctron{s) 
of the adjacent nuclerc acid molecules. In certain embodiments, nJcks In 
the coding or non-coding strand, preferably the coding strand, ara ligated 
by the addition of an appropriate nucfoic acid ligase prior to mitiaiing 
transcription. 
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The methods preferably employ any suitable strategy for 
generating nested RNA transcripts. The preferred methods rely on a 
modified Sanger sequencing strategy in which RNA polymerase Is used 
to generate a set of nested RNA transcripts obtained by base-specific 
5 chain termination. The&e are analyzed by mass spectrometry » A 
modified Gilbert-Maxam strategy is also cantempfated« in this method, 
the transcripts are cleaved with base^specif ic RNa&es to produce base- 
specifically termrnated fragments and the molecular weights are then 
assess by mass spectrometry. 

10 Transcription is initiated from the promoter by the addition of the 

appropriate RNA polymerase in the presence of ribonucleoside 
triphosphates and a selected base-specific chain terminating 3'- 
deoxyribonucieoslde triphosphate. In preferred embodiments, the 
transcription mixture also contains modified ribonucleoside triphosphates 

1 5 and ribonucleotide analogs ribonucleotide triphosphate analogs that 
reduce secondary structure of the resulting RNA transcrfpts and/or 
increase the fidelity of termination and turnover of the RNA polymerase 
enzyme thereby mcreaslnQ the amount of RNA transcr^t available for 
analysis.. These analogs include, but are not limited to: Inosine 5'- 

20 triphosphate (FTP), which reduces secondary structure, 4-thio uridine 
5'triphosphate (UTP) and 5-bromo UTP or 5''jodo CTP, which increase 
the fidelity of termination and turnover of the RNA polymerase enzyme 
thereby Increasing the amount of RNA transcript available for analysis. 
The resulting RNA transcripts are analyzed by mass spectrometry. 

25 in other embodiments, the sample further contains a matrix 

material for mass spectrometry for analysis by matrix-assisted laser 
desorption/lonization mass spectrometry (MALDI) and preferably further 
uses time-of-f light (TOF) analysis. The sequence of the nucleic acid is 
obtained by aligning the observed mass of the chain-terminated RNA 
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transcripts obtained from sequencing reactions comaining each of the 
four chain-terminating bases. 

The methods of sequencing provided herein may be used for 
diagnostic applications. Diagnostic applications, include methods for 
5 deternfTination of the presence of genetic alterations in a known target 
nucleic acid. For example, a region of the target nucleic acid is amplified 
and the nucleic acid strand corresponding to the noncodir\g strand is 
isolated. The nucleic acid probe containing the promoter may be tsolatad 
from a natural source or assembled synthetically by hybridising two 
10 complementary oligonucleotides to form a promoter sequence. A sin- 
gte-stranded region of at least 5 nucleotides that is complementary to a 
region of the nudeic acid to be sequenced or to a common sequence is 
fntroduced by recombinant means at the 3'-end of the coding strand. In 
prefen-ed embodiments, the promoter-containing nucFeic acid is 
IS cQvalently coupled via the 3'-end of the noncoding strand or 5'-end of 
the coding strand to a solid support and, more preferably. Is a 5'- or 3'- 
thiolated DMA linked at high densities to a amlnoailane-treated solid 
support. The linkage may be in the absence or presence of a linker group 
and is preferably arranged in an array format. 
20 A single-stranded 3' overhang of the nucleic aerd to be sequenced, 

in single-stranded or double-stranded form, Is hybridized to the 
complementary sequences of the noncoding strand and, in some 
embodiments, tlie niokls) between one or more nucleic acid strands 
is/are ligated prior to transcription. Transcription is initiated using the 
25 appropriate RNA polymerase in the presence of ribonucJeoside 
triphosphates and s selected base-specific chain terminating 3'- 
daoxyribonucfeoside triphosphate, in preferred embodnnents, the 
transcription mixture may also contain inosine 5' -triphosphate ar*d/or one 
or mor modified ribonucleoside triphosphates to fBcllltats analysis of the 
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RNA transcripts. The RNA transcripts are analyzed by mass 
spectrometry, preferably using MALDMOF. 

When used in array formats, a panel of promoter-containing 
nucleic acid probes can ba constructed and linked in an array such that 
5 the target nucleic acid is permuted along the entire sequence, e.g.. the 
coding sequence of a gene, allowing for the determination of the nucleic 
acid sequence of the entire gene during a sfngle reaction sequence* 

Methods of identifying transcriptional terminator and attenuator 
sequences are also provrded. By modifying transcription conditions to 

10 produce full-length transcripts, transcriptional terminator sequences, such 
as rho-dependent and rho-independent terminators, may be identified 
using mass spectrometric methods. In practicing the methods, a single- 
stranded region of the 3'-end of the target nucleic acid to be sequenced 
is hybridized to a complementary sequence at the 3'-end of the coding 

15 strand a promoter-containing nucleic add probe* In preferred 

embodiments, the promoter-contarning nucleic acid is co\faiently coupled 
via the B'-end of the noncodlng strand or 3' -end of the coding strand to a 
solid support and, more preferably, is e 5'- or 3'-thiolated DNA liniced at 
high densities to a aminosilane^treated sotid support. The linkage may be 

20 in the absence or presence of a linker group and is preferably arranged in 
an array format. 

Transcrfption fs initiated in the absence or presence of modified 
RNA triphosphate analogs that increase the effidency of RNA polymerase 
termination at such terminator sequences, such as 4-thio LPTPr 5-bromo 
25 UTP or 5'-iodo CTP. In certain embodiments, nicks in one or more strand 
may be ligated by the addition of an appropriate nucleic acid Hgase prior 
to inWating transcription (i^ adding a DNA or RNA liga$e)« The mass of 
the terminated RNA transcripts is determined by mass spectrometry. The 
observed mass is indicetive of the location of the termlnator-d pendent 
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airest of transcriptron and by comparing the aJlgnrriflnt of the sequence 
immediately preceding the site of tran&crfptional termination from distinct 
genomic locations, terminator end attenuator sequences may be 
identified for different RNA polymerases. 
5 DESCRIPTION OF THE DRAWtNGS 

Figure 1 illustrates a system for preparing arrays of a sample 
materiai for analysis. 

Figure 2 illustrates a pin assembly suitable for use with the system 
depicted in Fig. 1 for implementing a paralEel process of dispensing 
10 material to a surface of a substraie. 

Figure 3 depicts a bottom portion of the assembly shown in Fig. 2. 

Figure 4 depicts an alternative view of the bottom portion of the 
pin assembly depicted in Fig. 2. 

Figures 5A-5D depict a method for preparing an an^ay of sample 
1 B material. 

Figures 6A-6B depict an altarnatlve assembly for dispensing 
material to the surface of a substrate. 

Figure 7 Is a schematic showing covalent attachment of 
oiigodeoxynucJeotides to a silicon dioxide surface as described herein- In 

20 particular, silicon dioxide was reacted with 3-aminopropyitriethoxysHane 
to prodxice a uniform layer of primary amrno groups on the surface. A 
heterobifunctional crossiinldng agent was then reacted with the primary 
amine to Incorporate an iodoacetamide-group. An oiigodeoxynucleotida 
containing a 3'- or S'-drsulfide (shown as the 5') was treated with tris-(2- 

25 carboxy ethyl) phosphine (TCEP) to reduce the disulfide to a free thiol, 
which was then coupled to the lodoacetamido-surface. 
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Figure 8 is a graph which plots conjugation of ofigodeoxynucleo- 
tide probes to a siiicon surface as a function of TCEP concentration used 
in the disulfide reduction. 

Rgure 9 is a matrix assisted laser dasorbtion/lonization-time-of- 
5 flight <MALDI-TOF) mass spectrum of a silicon wafer with the 

ofigodeoxynucieotlda sequence denoted "TCUC {5'-GAATTCGAGCTCG 
GTACCCG6-3'; SEQ ID NO: ^\ covalently bound essentially as described 
in Figure 7 and the ollgodeoxynucleotide sequence denoted "MJM6" (5'- 
CCGGGTACCGAGCTCGAATTC-3'? SEQ ID NO: 2) hybridized thereto. 
10 Figure 1 0 depicts one embodiment of a substrate having welis 

etched therein that are suitable for receiving nwtarial for analysis. 

Figure 1 1 depicts one example of spectra obtained from a linear 
time of flight mass speotrometer instrument and representative of the 
material composition of the sample material on the surface of the 
15 substrate depicted in Fig. 10. 

Figure T2 depicts mclecuiar weights determined for the sample 
material having spectra identified in Fig. 11. 

Rgure 1 3 is a schematie of a 4 x 4 (le iocation) DMA array on the 
surface of a silicon wafer with the thiol-containing oUgonucieotide 
20 molecules denoted "Oiigomer 1 (5'-CTG(3ATQCQTCGGATCATCTTnTT-(S).3'; 
SEQ ID NO; 5), Oiigomer 2 |5'-<SI-CCTCTTGQQAactgtgtaqtatt-3'; SEQ 
ID NO: 6) and "Oligomer 3" (SEQ ID NOi 1; a free thiol derivative 
"TCUC" oligonucleotide of EXAI^PLE 1) covalently bound to 1 6 locations 
on the surface of the silicon wafer essentially as described in 
25 EXAI\1PLE 2. 

Figure 14 Is a schematic of the hybridization of specific 
oligonucleotides to each of the 16 locations of the DISIA hybridization 
array of Rgure 13 with the Oligomer 1 complementary ollgonuoleotide 
(5'-GATQATCCGACGCATCAQAATGT-3'; SEQ ID NO: 7) bound to 
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Ofigomer 1 , the Oligomer 2 complemdntary olrgonuoleatide (5'- 
AATACTACACAG-3'; SEQ ID NO: 8) bound to Oligomer 2 and the 
Oligomer 3 complementary oligonucleotide (5'- 

CCGGGTACCQAQCTCGAATTC*3'; SEQ ID NO: 2) bound to Oligomer 3, 
5 Figure 15 is a representative MALDI-TOF mass spectrum of a 4 x 4 

n6-locatlon} DNA array on a sMfcon wafer shown scheniaticelly in Figure 
15. The spectrum reveals a single, predominant signal of an 
experimental mass^to-charge ratio in each location corresponding lo the 
specific hybridized oligonucleotides. The 2+ indicates the position of a 

10 doubly charged molecule used as a reference standard during MALDI-TOF 
MS analysis. The * denotes residual amounts of contaminating 
oligonucleotide that remain on the surface of the chip following washing 
procedures. The relative position of the • signal reveals the approximate 
size of the contammeting oligonucleotide* 

15 Figure 1 6 is a representative MALDI-TOF mass spectrum of an 8 x 

8 f64-location) DNA an"ay. The spectrum reveals a single, predominant 
signal of an experimental mass-to-c;harge ratio corresponding to the 
predicted specific hybridized ofigonucleotldes. The * denotes residual 
amounts of contaminating oligonucleotide that remain on the surface of 

20 the wafer following washing procedures. The relative position of the • 
signal reveals the approximate size of the contaminating oligonucleotide. 

Figure 17 shows the nucleotide sequence of a DNA molecule (SEQ 
ID No: 101 assembled by hybridizing a 55-mer oligonucleotide to a 
complementary 25-mer oligonuoleotrde (SEQ ID No: 11) and a 

25 complementary 30-mer oligonucleotide (SEQ ID No: 12). The resulting 
double-stranded DNA encodes a SP6 promoter (nt 1-18 of SEQ ID No: 
10) and has a single nick In the coding strand of the molecule at nt +7 
relative to the start of transcription from the SP6 promoter. The position 
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of the nick and the start of OranBcription initiation from the SP6 promoter 
are indicated. 

DETAILED DESCRrPTIQN AND PREFERRED EMBODIMENTS 
Definitions 

5 Unless defined otherwise, all technical and scientif rc tarms used 

heretn have the same meaning as is commonly understood by one of skill 
in the art to which this invention belongs. All patents and publications 
referred to herein are incorporated by reference herein. 

As used herein, the term •'nucleic acid" refers to oligonucleotides 

10 or polynucleotides such as deoxyribonucJeIc add <ONA) and ribonucleic 
acid {RNA) as well as analogs of either RNA or DNA, for example, made 
from nucJeotide analogs, any of which are in single or doubla-stranded 
form and also RNA. NucJeic acid molecules can be synthetic or can be 
isolated from a particular biological sample using any number of 

tS procedures which are well-i(nown in the art, the particular procedure 
chosen being appropriate for the particluar biological sample. 

As used herein, nucleotides faiclude nucleoside mono-r di-, and 
triphosphates. Nucleotides also Include modified nucleotides such as 
phosphorothioate nucleotides and deazapurine nucleotides, A complete 

20 set of chain-elongating nucleotides refers to four different nucleotides 
that can hybridize to each of the four different bases comprising the DNA 
template. 

As used herein, a nucleic acid promoter-containing probe refers to 
a nucleic acid fragment that includes a double-stranded region encoding a 
25 promoter and a single-stranded region that contains at least 5 nucleotides 
at the 3'-end of the coding (sense) strand relative to the promoter. The 
at least 5 nucleotides are selected to be complemerrtary to a single- 
stranded region at the 3'-end of a nudeic acid containing the target 
nucleic acid of interest. Reference to the S^end refers to the antlsens 
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strand ff the target is contained in a doubte-stranded maiecule. When the 
target is double-stranded the resulting moJecule has rtioks fn the coding 
and non-ooding strands, 

AlttmatlvelY. the single-stranded region of the nucleic acid 
S promoter-contafning probe includes at least about 5 nucfeotides et the 5'- 
end of the noncoding <antisense) strand (relative to the promoter) that 
are complementary to at least about five nucleotides at the 5'-end of the 
coding strand of a target nucleic acid. Jn this embodiment, the target 
must be double-stranded. Following hybridization, the resulting 
10 moJecule has nicks in the coding and non-coding strands. 

In all embodiments, nicks can be ligated prior to transcription. As 
shown herein, however, ff the nicks are at a sufficient distance, 
preferably at least about +7 nucleotides, it is not necessary to ligate. 
As used here in. the target nucleic acid is the nucleic acid 
15 molecule to be secfuanced. It can be part of a larger moleude. The 
molecule that contains the target nucleic acid can contain or can be 
modified to contain at least about 5 nucleotides whose sequence is 
designed to be fully complementary to at least 5 contiguous nucleotides 
at the 3' end of the over hang an the promoter-containing capture probe 
20 (or 5' end depending upon the configuration selected). The target 
nudeic acid can be double-stranded, with a 3' or 5' single-stranded 
overhang , or single-stranded. RNA polymerases can and do transcribe 
single-stranded molecules, particularly, once the ternary complex is 
formed. The target may also be RNA or PNA (protein nucfeic acid formed 
23 by conjugating bases to an amino acid backbone; see. e^, Nielsen et 
aL {1 991 ) Science 254:1497). 

As used herein, nucleic acid synthesis refers to any process by 
which oligonucleotides or polynucleotides are generated, including, but 
not limited to processes invoking chemical or enzymatic reactions. 
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As used herein, a base-spec'ificallY terminated rlbonucieotides is 
one that generated during transcription by inoorporation a nucleotide tlnat 
results in transcription termrnation. Base-specif icaily terminating 
ribonudeoside triphophates, wliich produce base-specifically terminated 
5 rlbonucieotides, are icnown to tlioae of skill in the art. ExampJes of base- 
specifically terminating ribonudeoside tripliosphates include, but are not 
limited to: d'-deoxyrlbonucleoslde triphosphates, such as 3'-dGTP, and 
others described herein and Icnown to those of skill in the art. 

As used herein, complementary when referring to two nucleotide 

10 sequences^ means that the two sequences of nucleotides are capable of 
hybridizing, preferably with less than 25%, more preferably with less 
than 15%, even more preferably with tess than 5%, most preferably with 
no mismatches between opposed nucleotides. Preferably the two 
molecules will hybridize under conditions of high strvigency. 

15 As used herein: stringency of hybridization In determfnlng 

percentage mismatch is as follows; 

1) high stringency: 0.1 x 5SPE, 0.1% SDS, 65®C 

2) medium stringency: 0.2 x SSPE, 0.1 % SDS, 50«C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, SO^'C 

20 It is undenstood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures. 

As used herein, the term •array" refers to an ordered arrangement 
of members or positions. The array may contain any number of members 
or positions and can be In any variety of shapes. In preferred 

25 embodiments, the array Is two-dimensional and contains n x m members, 
wherein m and n are integers that can be the same or different. In 
partlculariy preferred embodiments, n and m are each 4 or a multiple 
thereof. 
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The term ''cross-linking agent"* is art-recognized, and, a$ used 
herein r refers to reagents taht txnmobiiize a nucleic acid to an Insoluble- 
support, preferably through covalent bonds. Thus, appropriate "cross- 
linking agents'" for use herein indudes a variety of agents that are 
S capable of reacting with a functional group present on a surface of the 
insoluble support and with a functional group present tn the nucleic acid 
molecule. Reagents capable of such reactivitv Include homo- and hetero- 
bif unctions I reagents, many of which are known in the art. 
Heterobifunctionai reagents are preferred. 

10 As used herein, The term "thiol-reactive functionality/' refers to a 

functionality that reacts rapidly or preferably with a nucteophilic thiol 
moiety to produce a covalent bond, such as a disulfide or thioether bond. 
In generel, thiol groups are good nucieophiles, and preferred thlol-reactlve 
functionalities are reactive electrophiles. A variety of thioUreactive 

15 functionalities are known in the art, and include, for example, hatoacetyis 
{preferably iodoacetyl), diazoketones, epoxy ketones, o, &-unssturated 
carbonyls (such as, a, (Jrenones) and other reective Michael acceptors 
{including malelmide), acid haltdes. benzyl halides, and the like. In 
certain embodiments, a free thiol group of a disulfide can react wrth a 

20 free thiol group, such as by disulfide bond formation, including by 
disulfide exchange. A '*thiol-reactiva' cross-linking agent, as used 
herein, refers to a cross-linking reagent (or surface) which indudas, or 
can be modified to Include, at least one thioi-reactive functionality. 
Reaction of a thiol group can be temporariiy prevented by blocking with 

25 an appropriate protecting group, as is conventional tn the art (see e,Q. , 
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T.W. Greene and P.G.M. Wuts "Protective Groups in Oroanic Synthasis," 
2nd ed. John Wiley & Sons, (1991)). 

As used herein, a selectively cleavabie linker is a Etnker that is 
cleaved under selected condltfons. such as a photccleavable linker, a 
5 chemicaJly cleavable linker and an enzymaticaNy cleavable linker (i^e-, a 
restriction endonuclease site or a ribonudeotide/RNase digestion^ The 
linker is interposed between the support and (mnnobilized DNA, 

As used herein, the terms *'proteln", "polypeptide" and " peptide*' 
are used interchangeably when referring to a translated nucleic acid, 

10 such as a gene product. 

As used herein, "senfiple" refers to a composition containrng a 
material to be detected, in a preferred embodiment the sample is a 
"biological sample^ which refers to any material obtained from e living 
source. Including but not limited to aramal, particularly mammals, 

IS including humans, pfent, bacteria, fungi, protrst and virus. The bblogicdl 
sample can be in any form, including solid materials^ such as tissue, cells 
and biopsied material, and biological fluids, including urine, blood, saliva, 
amniotic fluid, cerebral spinal fluid and mouth wash (containing buccal 
cells). Preferably solid materials are mixed with a fluid. 

20 As used herein, "substrata*' or "solid support" shaH mean an 

insoluble support onto which a sample is deposited according to the 
materials as described herein. Examples of appropriate substrates 
include beads, including but not limited to sliica gel, controlled pore 
glass, magnetic, dextran, agarose and ceilulose, cepillarids, flat supports 

25 such as glass fiber filters, glass surfaces, metal surfaces (steel, gold, 

sih^er, aluminum, copper and silicon), plastic materials Including multlwell 
plates or membranes (§ji^, of polyethylene, polypropylene, polyamide, 
polyvinylidenedlfJuoride}, pins , such as arrays of pins suitabJe for 
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combinatorial synthesis or analyste or beads in pits of flat surfaces such 
as wafers, pariticulariy silicon wafers, with or wfthout plates. 

As used herein, RNA polymerase refers to DNA-dependent RNA 
polymerases and RNA-dependent RNA polym^ses. Any RNA 
5 polymerase that recognizes a specif rc promoter sequence and is capable 
of initiating transcription and elongating a RNA transcript is contemplated 
within the soope of the term herein. Exemplary RNA polymerases that 
may be used in the methods provided herein Include, but are not limited 
to those obtained from: 1) archeabacteria, such as Halobacterium . 

10 Methanobacterium, Methanococcus. Sulfolobalej^ and Thermoplasma: 2) 
eubacteriae such as gram negative bacteria, e^, Escherichia coll and 
strains of Salmpp^lja and ShioeHa. gram positive bacteria, Bacillus 
SiilsSM end Staphjopopffus aureus: 3) bacteriophages, 3uch as T7, T3, 
SP6, SP6 nicked and N4; 4) DNA viruses: 5) RNA viruses, such as 

15 Influenza vJrus; 6t plants, such as wheat; and 7) eulcaryotic RNA 

polymerase II isolated from fungii, jLflu, Saccharomyc^jt cerevisae and 
higher eukaryotcc organisms, mammals- Also included within the 
scope of the tarm RNA polymerase as used herein is the RNA phage 
replloase (see, e^ International PCT application No.PCT/US87/00880 

20 and 5,270,184). Eulcaryotfc RNA polymerases, such as those isolated 
from eukaryotic tissues, incfuding plants and animals, that recognize 
eukaryotic promoters, including viral promoters from viruses that infect 
eukaryotic cells, are also contemplated herein. The T7, T3, and SP6 
RNA poiymerases have been ctoned and their sequences and methods for 

25 their purification as well as for purification of the bacterial RNA 

polymerases and certain eukaryotic DNA dependent RNA polymerases 
are known. In addition, numerous RNA polymerase commercially 
available in large quantities, and can also be readily produced from 
commerically available vectors. The sequences of the promoters for ach 
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of the above-noted polymerases and other RNA polymerase ere also 
known to those of skilJ in the art. 

As used herein, a promoter region refers to the portion of DNA of 
a gene that contrQis expression of DNA to which It Is operatively Jinked. 
5 The promoter region includes specify sequences of DNA that are 
sufficient for RNA polymerase recognition, binding and transcription 
initiation. This portion of the promoter region is referred to as the 
promoter* In addition, the promoter region includes sequences that 
modulate this recognition, binding and transcription initiation activity of 

10 the RNA polymerase. These sequences may be cts acting or may be 
responsive to trans acting factors. Promoters, depending upon the 
nature of the regulation, may be constitutive or regulated. A constitutiva 
promoter is always turned on. A regulatafafe promoter requires specific 
signals to be turned on or off. A develcpnrientally regulated promoter is 

15 one that is turned on or off as a function of developmant 

The promoter can be of a consensus sequence or variant. When a non- 
witd-type promoter is used, transcription will occur at a rate sufficient to 
produce a detectable transcriptr and is typlcaUy at least about 5-10% of 
ihe rate at which transcriptron would have occurred If a wild-type or 

20 native pronioter had been used by the RNA polymerase to transcribe the 
nucleic acid in vitro. Each species of RNA polymerase Is speciflc 
for a particular pronioter sequence. Each of the three bacteriophage 
roiA polymerases {T7, T3, and SP6) has its own specific DNA promoter 
sequence. After the RNA polymerase has attached to the promoter 

25 sequence of the template and transcription has been initiated, the 

promoter sequence releases the RNA polymerase. The RNA polymerase 
then continues transcription along the template until (1) it runs out of 
RNA starting materials, (2) the transcriptton reaction is terminated by a 
partbular nucleotide, nucleosrde or analog, {31 transcription is 
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spontaneousfy aborted whan the RNA polymerase and the template 
prematurely disassociate or (4) the end of the DNA template is reached. 
As used herein, a "promoter-containing nucleic acid" is a nucleic 
acid that contains a sequence of nucleotides that directs the site-specific 
5 binding of an RJVIA pciymerasa molecule , which binds and melts the 
double-stranded DNA to form an open transcription rnitration complex 
that is capable of rnitiating RNA synthesis in the presence of 
ribonucleotide triphosphates. Thus, a functional promoter sequence is a 
sequence of nucleotides to which a DNA dependent RNA polymerase 

10 binds, meits in. forms an rnitiatlon complex and initiates transcription. 
The promoter-containfng nuolerc acid capture probe refers to the double- 
stranded molecule the includes the promoter region and a 3' overtiang of 
at Jeast about 5 nciclectides to which the target nucleic acid hybridizes, 
particularly under high stingency conditions. Typicalfy such hybridization 

15 requires about 5 contiguous base pairs of identity between the target and 
promoter-containing capture probe. After Initiating transcription, the 
RNA polymerase (or subunits thereof) is part of ternary complex 
containing the polymerase, the DNA, and the nascent RNA transcript. 

As used herein, a "coding strand " refers to the nucleic acid strand 

20 of a promoter-contalnlng nucleic acid that has the same polarity as a 
corresponding mRNA molecule initiated from that promoter. Thus, It 
refers to the sense strand. The non-coding strand is the enti-sense 
strand that is transcribed by RNA polymerase. 

As used herein "native nucleoside (or ribonucleoside) triphosphate" 

25 or "native nucleotide" or ribonucleotide triphosphate means any one of 
the naturally occurring normal precursors of RNA, i,e., ATP, GTP, CTP, or 
UTP. A "nucleoside triphosphate analog" or'nucleotide analog" means 
any nucleoside triphosphate analogous to native nucleotide, but lhat 
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contafns one or more chemical modifications compared to the native 
nucleoside or nucleotide. 

As used herein, a "matrix material'' refers to a material used in 
mass spectrometry that is a proton donating, UV absorbing material* 
5 usually an organic acid, that forms crystalline matrix-nucleic acid 

structures that are readrly lonizable during MALDI. An exemplary matrix 
material is a solution of 3-hydroxypiGolinlc acid {3-HPA, 0.7 M in 50% 
acetonitriler 10% ammonium citrate). 

As used herein, "conditioning" refers to a process whereby prior to 

10 mass spectrometric analysis, the nucleic acid to be analysed is modified, 
for example to decrease the laser energy required for voJetization and/or 
to minimize fragmentation. Conditioning can be performed on a nucleic 
add, preferably after immobilization. An example of conditioning ie 
modification of the phosphodJester backbone of the nucleic acid molecule 

tB by, for example, cation exchange), eliminates peak broadening due to a 
heterogeneity in the cations bound per nucleotide unit. Contactrng a 
nucleic acid molecule with en alkylating agent such as alkyllodide, 
iodoacetamide, -iodoethanoJ, or 2,3-epoxy-1-prDpanol, the monothio 
phosphodiester bonds of a nucleic acid molecule can be transformed Into 

20 a phosphotriester bond. Likewise, phosphodiester bonds may be 
transformed to uncharged derivatives employing trialkylsilyl chlorides. 
Conditioning also can be performed by incorporating modified nucleotides 
into a nucleic acid during synthesis. Such modified nucleotides can be 
used to reduce sensitivity for depurination {fragmentation during MS), 

25 sucg as with a purine analog such as N7- or N9-deazapurine 

nucleotides or ribonucleotide. Modified nucleotides also can include RNA 
building blocks, oligonucleotide triesters, phosphorothioate functions 
which ere afkylated, oroligonud'Otide mimstics such as PNA. In 
preparing conditioned RNA transcripts with modified nucleotides, the 
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RNA polymerase chosen must be able to incorporate the modified 
nucleotide into the RNA transoript. 
IVIethods for sequencing 

Mass spectrometric methods of sequencing nucleic acids are 
5 provided. In particuJar, the sequencing methods use nucleic acid 
promoter-containing probe that contain a double^randed region 
encoding a promoter and a single-stranded region for hybridizing to a 
complementary region on target nudeic acids. The nucleic acid 
promoter-containing capture probe is preferably vnmobizsd on a solid 

10 support. The single-stranded region is hybridized to a complementary on 
a target nucleic acid, which comains the nucleic aoid to be sequenced 
and at least 5 nucleotkles of single-stranded region identical to the 5 
contiguous single-stranded nucleotides on the probe. 

Following hybridization of the target to the nucleic acid promoter- 

15 containing probes, the nucleic acid sequence Is determined by generating 
a set of nested base-specific chain terminated RNA transcripts initiated 
from the promoter and transcribed through the target nucleic add. The 
resulting nested RNA transcripts ara analyzed using mass spectrometry. 
While ONA is often the preferred vehide for sequencing, the 

20 methods herein are sequence RNA transcripts. RNA fragments are more 
stable during matrlx-assfsted Jaser desorptlon/ionlzation <MALDI) mass 
spectrometry than DIMA fragments. Without being bound by any theory, 
the enhanced stability may result from the presence of the 2'-hydfoxvl 
group on the sugar moiety of RNA. which helps to reduce depurination 

25 during the MALDl process. 
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RNA poiymerase* ftnd promotars 

Any RNA polymerase capable of directing the In vitro transcription 
of an RNA molecule is contemplated for use in the niethods described 
herein. Preferred RNA polymerBses are DNA-dependent RNA 
5 polymerases. iMethods for isolating and purifying RNA polymerase 
molecules are well known to those of skBI In the art: Q/? replicase (seee, 
e^, U.S. Patent No. 5,686,249. Re: 35,443 and Eoyang sllL. (1971) In 
Procedures in Nucfeic Acid Researnh Cantoni and Davles, eds., Volume 
2, pp. 829-839, Harper and Rowe, NY); bacteria (e.g. . t fifiii. see 

10 Burgees and Jendrisak 11 976) Bioehemifi^/w J4J4634-4638 and Hager et 
SL {1 990)Si2fihsmistDi 23:7890-7894: JjishmaoH. afli, see Sadhukhan 
gi IL <1 997} Mol. CaH. BInfthPnn J2i: 105-1 14j Baoillus subtilis . 
Qlacomoni (1980) IUL.JjJiflfiiiefli,lflS;579-591); phage {a^a^ T7, 
McDonneil si aJ, (1 977) J. Mol. Bl»l §3:71 9-736 and Studier el aL 

15 11969) Viffilaay 28:562-574; see also HefiiaL 11997) Protein Exor. 
fyiiftS: 142-151); viruses (tju» rhabdovirus, see Das et aT (1996) 
Methods gniviT|f>|, ^7?^-flfl.i turn^ mosiae virus, see Oeidman si sL 
(1997) si. VirBlMflth,M:184-195; vacctnla. Gershon ej (1 996) 
Methods EnzyiTioi. aZ&:35-57); and mammaUan {g^, yeast polir, Koteske 

20 filfiL (1996) Mgtijod9jna2D2L 273:1 76-184; human pollJ. Maidonado 
SiaL (1996) MMhods^BizymoL 274:72-100). in addition, a number of 
prokaryotic. eukaryotw, bacteriophage and viral RNA polymerases, such 
as T3. T7 and SP6 are commercially available <ejj.. sold by Stratagena, 
La Jolla, CA; Boehringer Mannheim. Indlanapons, IN; Pharmacia. Uppsala, 

25 Sweden; and Sigma Chemical Corp, St Louis, Mo) and/or well known. 
The promoters are also well known and readily available (e.g.. the pGEM 
series of plasmlds are available from Promega, JMadison Wi: see also, 
U,S, Patent Nol 4,766.072, which describes construction of the pGEM 
piasnrdds). 
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In practicing the methods, DMA-dependent RNA poiymerases and 
RNA-dependant RNA polymerases may be used. For exan^le, RNA 
polymerases that may be used in the methods provided herein include, 
but are not limited to those obtained from: 1 ) archeabacteria, such as 
5 Haiobacterium, Methanobactarftim Methanoe^peus. SuHolobalas end 
Therrnoplasrna; 2) eubacteria, such as gram negative bacteria, e.g. . 
Escherjghl? £06 and strains of Salmonella and Shjoalla. gram positive 
bacteria, fi.fl., Baeiliiis subtills and Staphlococeus aureus: 3) 
bacteriophages, such as T7. T3, SP6 end N4: 4-) DNA viruses; 5) RNA 

10 viruses, such as influenza virus; 6) plants and plant viruses, such as 
wheat and tumlp mosiac virus; and 7) eukaryotie RNA polymerase II 
isolated from fungf, Saccharomvcftft cgifij^isgg and higher eukaryotie 
organisms, mammals for a review see in RNA PQlvmerMe anri 
Th? P^qtilation ftf Tr^npcrlpTion, Retinl^off aU, eds, Elsevier. NY). Also 

16 included for use herein Is the QjJ replicase from the Qfi RNA phage (Sifl^ 
see U.S. Patent Noa. 5.670.353, 5.696,249 and Re: 35.443). 

The selection of the appropriate RNA polymerase for a nucleic acid 
template to be sequenced is within the skill of the skilled artisan and 
varies according to the nucleic acid molecule to be sequenced and the 

20 selected promoter region. The selection may be determined emplricslly 
following using teachings known to those of skill in the art, including 
those described herein. Preferred nucleic acid molecules to be 
sequenced are DNA molecules. RNA and, particularly, PNA (protdn 
nucleic eoidsj are also amenable to sequencing, upon selection of an 

25 appropriate RNA polymerase. 

Each nucleio acid promoter-containing probe used In the 
sequencing methods described herein contains a promoter. The 
promoters used In the methods herein can be obtained from any source 
or can be prepared synthetically based upon sequences recognized by 
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the selected RNA polymerase. For e^cample, the nucleic acid containing a 
promoter may be obtained directly from a variety of different organisms, 
such as bacteria, viruses, phage end eukaryotic arganisms, by cloning or 
may be obtained from commercially available expression vectors le.g, , 
5 T7, Ta, SP6 and Ap^ and Ap^ promoters from Boehringer Mannheim and 
Phamwcia; bte or (ac promoters, RSV-LTR promoter and F9-1 promoter; 
Stratagene). The selection of the appropriate promoter will depend on 
the nucleic acid to be sequenced, sequencing conditions, and most 
Importantly, on the RNA polymerase selected for trsnscriptron. 

10 Promoters can be produced by hybridizing Qligonucleotldes 

encoding the coding and non-coding DNA strands of a promoter <5ee 
Example 4), The separate strands of nucleic acid ere obtained preferably 
by nucleic acid synthetic techniques. Oligonucleotides used to assemble 
a promoter are designed such that foltowing hybridization, there exists a 

15 single stranded region at the 3'-end of *e coding strand to which target 
DNA can hybridize. 

Immobilization of nuclete acid promoter-contafnlng probes 
In preferred embodiments, the nucleic ecid promoter-containing 
probe b immoUrizedr directly or by means of a cross^linking agent to a 

20 solid support provided herein. The probe can be immobilized prior to 
hybridization or subsequent to hybridization to the target. Preferably and 
most conveniently hybridization Is effected foHowing immabilization to 
the solid support. The probe ]$ preferably immobilized via the 5' end of 
the coding strand or the 3' end of the non-ceding strand. 

25 In other embodiments, a promoter and a downstream target 

nucldc ecid can be isolated as a transcriptional unit from genomic DNA 
by amplification with primers or hybridization cloning technology, DNA 
encoding the promoter and target can then be immobilfzed onto a solid 
via the terminal nucleotides of the 5' end of the coding strand (sense 
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fitrandt or the 3'-end of the non-coding &trand» As discussed further 
beloWp a Jinking group can be rncorporated into PGR prrmers used to 
isolate the promoter-containing nucleic acid template to achieve directed 
immobilization to a support Use of a suitable RNA polymerase that 
5 recognizes the pron(K>ter, will in the presence of chain terminating 
nucleotides will result In a set of nested fragments from which the 
sequence can be determlrrad^ 

Preferred solid supports are those that can support linkage of 
nucleic acids thereto at high densrties, preferably such that the 

10 GovaJently bound nucleic acids are present on the substrate at a density 
of at least about 20 fmol/mm'^ mora preferably at least about 75 
fmoJ/mm^ still more preferably at least about 85 ftnol/mm^ yet more 
preferabFy at least about 100 ^mol/mm^ and most preferably at least 
about 150 fmol/mm^. Among the most preferred substrates for use jn 

15 the particular methods of Immobilizing nucleic acids to substrates 
provided herein Is silicon, whereas less preferred substrates Include 
polymeric materials such as polyacrylamide. Substrates for use in 
mettiods of producing an^ays provided herein include any of a wide 
variety of insofuble support materials including, but not limited to silica 

20 gel, controlled pore glass, cellulose, glass fiber filters, glass surfaces, 
metal surfaces (steel, gold, silver, aluminum, silicon and copper), plastic 
materials (g^cL, of polyethylene, polypropylene, potyamide, polyvinyldene- 
dlfluoride) and siitcon. 

fn embodiments of the methods in which a cross-lfnking reagent is 

25 not employed, a modified nucleic acid Is reacted directly with a 

approprfately functlonalized surface to yield immobilized nucleic acid. 
Thus, for example, an iodoacetyl-modified surface (or other thiol-reactive 
surface functionality) can react with a thioi-modffied nucleic acid to 
provide immobilized nucleic acids. 
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In embodiments using a cross-linking agent, the crass-linking agent 
is Gelected to provide a high density of nucleic acids immobilrzed on the 
insoluble support. The cross-linking agent fend other reagents used to 
f unctionalize the support surface or the nucleic acid molecule) can be 
5 selected to provide any deatred spacing of the immobilized nucleic acid 
molecules from the support surface, and to provide any desired spacing 
of the immobilized nucietc acids from each other. Thus, stertc 
encumbrance of the nucleic acid molecules can be reduced or eliminated 
by choice of an appropriate cross-linking agent. In certain embodiments, 

10 the cross-linking reagent can be selected to provide multiple reactive 
functionalities as used in dendrimer synthesis for attachment of multiple 
nucieic acids to a ^ngle cross-linking moiety. Preferably, the cross- 
linking agent is selected to be highly reactive with the nucleic acid 
molecule, to provide rapid, complete, and/or selective reaction* h 

15 preferred embodiments, the reaction volume of the reagents <e>g,, the 
thiol group reagent and the thioi-reactive functionality) is small 

Modified nucleic acid promoter-containing probes and linkers 
Preferred nuctelc acid promoter-containing probes for use herein 
are "thioJ-modifled nucleic acids," i-e., nucleic ecids derivatized to 

20 contain at least one reactive thiol moiaty. As described in further detail 
fn Example 1 , below, nudeic adds containing at least one reactive thiol 
are preferably made by treating a nucleic aoid containing a 3' or 5' 
disulfide with a reducing agent, which preferably wilf not compete in 
subsequent reactions <i,e. will not react virith an iodoacetimido 

25 Functionality). Disulfide-derivatized nucleic acids can be synthesized 
according to a variety of methods. For example, a nudeic acid can be 
modified at the 3'- or 5'-terminus by reaction with a disuEfide-contalnrng 
modifying a reagent. Alternatively^ a thiolated primer can by 
enzymaticaliy or non-enzymaticaily attached to the nucleic add. A 6'- 
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phosphoramidate functionality can also provide an attachmoirt point for a 
thiol or disuilide-contalning cytosine or deoxycytoslno. Examples of 
reducing agents appropriate for reduction of a disulfide-modifiad nucleic 
acid include: tris-(2-carbaxyethyl)phosphlne (TCEP) (preferably a 
5 concentration in the range of 1-1 00 mM (most preferably about 10 mM)} 
i» reacted at a pH In the range of 3-6 (most preferably about 4.6), a 
temperature in the range of 20-45 "C {most preferably about 37*0) for a 
time period in the range of about 1 to about 10 hrs (most preferably for 
about 6 hrs); dithiothreitoi (preferably a concentration In the range of 25 

10 to 100 mM (depending on whether the reactant is Isolated i is reacted at 
a pH in the range of 8-10 (most preferably about 8) and at a temperature 
in the range of 25-45'»C (most preferably about aT^C)) for a time in the 
range of about 1 to about 10 hrs (nnost preferably about 5 hrs). TCEP 
provides an advantage in the low pH at which It is reactive. This low pH 

15 effectivdy protonates thiols, thus suppressing nucieophrJic reactions of 
thiols and resulting in fewer side reactions than with other disulfide 
reducing agents which are employed at higher pH. 

As further described in Example 1, below, a preferred bifunctionai 
cross-nnking agent is N-8UCCinimidyI(4-iodacety|) aminobenzoate (SIAB). 

20 Other crossllnking agents include, but are not limited to, dimaleimfde, 
dithio-bis-nitrobenzoic acrd (DTNB), N-suoclnlmidyl-S-acetyl-thioacetate 
(SATA), N-succlnlmidvl-3-(2-pyridyldithiQl propionate (SPDR, 
sucdnrmidyl 4-(N-malelmldomethyl)cvclohexan6-1-carboxylate (SMCC) 
and 6-hydrazinomcotimide (HYNIC) may also be used in the novel 

25 process. For further examples of cross-linking reagents, see, Wong 
'Chemistry of Protein Coniuaation and Ckirr-I inifing " CRC Press 
11991), and Hermansen, " Bioconiuaate Technir|iii=.ft ' Academic Press 
(1895). 
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\n other embodiments, the nucleic acid is immobilized using a 
photocleavdble or photolebile linker moieties that i$ cleaved during mass 
spectrometry. Exemplary photolabiie cro^s-linlcBr include, but are not 
finuted to, 3-am'rno-l2-nitrophenyl|propionic acid (Brown ej aL I19&5) 
b Molecular Diveraitv. pp.4-1 2 and Rothschild et aL (1 996) Nucfeic Acids 
Res. 24:361-66) See, also International PCT application No. 
{WO 98/20019). 

A nuclejc acid promoter-containing probe can be directty Jinked to 
a solid support via a reversible or irreversible bond between an 

10 appropriate functionality (L') on the target nucleic acid molecule (T) and 
an appropriate functionaltty IL) on the capture molecule. The format is 
similar to that shown in U.S. Patent No. 5,503,980. A reversible Irnkage 
includes linkages that are cleaved under the conditions of mess 
spectrometry. These include, but are not limited to photocleaveble 

15 bonds, such as a charge transfer complex, and labile bonds formed 
between reJatively stable organic radicals. 

Furthermore, the linkage can be formed with L' being a quaternary 
ammonium group, in which case, the surface of the solid support, 
preferably, carries negative charges to repel the negatively charged 

20 nucleic acid backbone, thereby facilitating the desorption required for 
analysis by a mass spectrometer. Desorption can occur either by tha 
heat created by the laser pulse and/or, depending on L/ by specific 
absorption of laser energy which is in resonance with the L' 
chromophore. 

25 Thus, the L-L' chemistry can be of a type of disulfide bond 

{chemically cleavable, ejg., by mercaptoethanol or dithfoerythrol; a 
blotln/streptavidin system; a heterobifunctional derivative of a trityl ether 
group: see, gjt., Kfister et aL (19901 ''A Versatile Acld-Labiie Unker for 
Modification of Synthetic Blomoieoules," Tetrahedron Letters 31 !7Q9S> 
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that can be cleaved under mifdiy addic conditions as well as under 
conditions of mass spectrometry, a levulinyl group cleavable under 
almost neutral conditions with a hydrazinium/acetate buffer, an 
afginine-arginine or lysine-lysine bond cleavable by an endopeptidase 
5 enzyme tike trypsin or a pyrophosphate bond cleavabia by a pyrophos- 
phatase, or a ribonucleotide bond in between the oligodeoxynucleotide 
sequence, which can be cleaved, for examptep by a ribonuclease or alkali. 

The functionalities, L and L/ can also form a charge transfer 
compJex and thereby form the temporary L-L' linkage. Since in many 

10 cases the "charge-transfer band " can be determinad by UV/vis 
spectrometry (see, e^, Organic Charge Transfer Complexes by R. 
Foster, Academic Press. 1969), the laser energy can be tunad to the 
corresponding energy of the charge^transfer waveJength and, thus, a 
specific desorption off the solid support can be initiated. Those skilled in 

15 the art wiH recognize that several combinations can serve this purpose 
and that the donor functionality can ba either on the solid support or 
coupled to the nucteic acid molecule to be detected or vice versa. 

in another approach, a reverable L-L' linkage can be generated by 
homolytically forming relatively stable radicals. Under the influenca of 

20 the laser pulse, desorption (as discussed above) as well as ionization will 
take place at the radical position. Those skilfed in the art will recognize 
that other organic radicals can bo selected and that, in relation to tlie 
dissociation energies needed to homoiyticaily cleave the bond between 
them, a corresponding laser wavelength can be selected (see e.g. , 

25 Reactive IVIoleculefi by C. Wentrup, John Wiley & Sons, 1 984). 

As noted, several embodiments of the method Involving of 
inrkmobilrzation are contemplated herein. These include, but are not 
limited to embodiments in which: 1} the primer nucleic acid is 
tmmobliized to the solid support and the target nucleic 8c?d is iiybridized 
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thereto to form a promoter sequence; 2) a double-stranded nucfeic acid 
encoding a promoter (amplrfied or isofated) is immobilized through linkage 
to one predetemnined strand, and in v|ti:& transcription is initiated in the 
presence of a predetermined deoxy ribonucleotide, and 3) the target 
5 nucleic acid, which cam be amplified where there is available sequence 
rnformation. is hybridized to a double-stranded nucleic acid capture probe 
with 3' overhang, and the resulting hybrids are immoblGzedt 

in the embodiments where the primer nuclefc acid is immobiiized 
on the solid support and the target nucleic acid is hybridized thereto, the 

10 inclusion of a cleavaMe linker permits the primer DNA to be immobiiized 
at the 5'-end so that free 3'-0H is availabie for "hybridizing" target DNA 
to the free DNA strand and initiating transcriptfon to 

Any linker known to those of skifl fn the art for rmmobilizing 
nucleic acids to solid supports may be used herein to link the nucleic acid 

15 to a solid support. The preferred linkers herein are the selectively 
deavable linkers, partioulariy those exemplified herein. Other linkers 
include, acid cleavable (inkers, such as bismaleimideothoxy propane and 
acid-labile trityl linkers. Acid cleavable linkers, photocleavable and haat 
sensllSve linkers may also be used, particularly where it may be 

20 necessary to cleave the targeted agent to permit it to be more readily 
accessible to reaction. 

Acid deavable linkers 
Acid cleavable linkers include, but are not limited to, brsmafeimi- 
deothoKy propane; and adipic acid dihydrazide linkers (see, e^a-, Fatrom 

25 et SL (19921 infection & Immun fiQ;RBil.i;flQ) and acid labile transferrin 
conjugates that contain a sufficient portion of transferrin to permit entry 
Into the intracellular transferrin cyclrng pathway (see, e.g,, Welhdner et 
SL 09911 J. Biol. Ch8rn> 266:43Q9.A31 A) 
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Photocleavable Linkers 
Photocleavable linkers are linkers that are cleaved upon exposure 
to light (see, QoWmacher gl aL (19921 Blocont. Chait), 3; 104^1 07). 
thereby releasing the targeted agent upon exposure to light. 
5 Photocleavabie linkers that are cleaved upon exposure to light are known 
(see, 2^. Hazum filaL <1981) in Peot.. Proe. gur. Peat. SvirD. Ifith 
Brunfeldt« K |Ed), pp. 105-1 10, wWch describes the use of a nltrobenzyl 
group as a phatocleavabie protective group for cysteine; Yen §t 
(1989) Mekremol. Chem ian;fifl-p-? which describes water soluble 

10 photocleavabie copolymers, including hydroxypropylmethacrytamlde 
copolymer, glycine copolymer, fluorescein copolymer and 
methylrhodamlne copolymer; Qoldmacher aLd 992} Bioconi. Chem. 
3:104-107, which describes a cross-linker and reagent that undergoes 
photolytic dBgradatlon upon exposure to near UV Kght (350 nm>; and 

15 Senter M aL d 985) Photocham. Phntnhjy^i £2:231 -237, whteh describes 
nitrobenzyloxycarbonyl chloride cross linking reagents that produce 
pliotocleavable linkages), thereby releasing the targeted agent upon 
exposure to light. In preferred embodiments, the nucleic acid is 
immobilized using the photocleavabie linker moiety that is cleaved during 

20 mass spectrometry. 

ChemicaOy oleavable finksra 
A variety of chemically cleavable linkers may be used to introduce 
e cleavable bond between the ImmobiBzed nucleic acid and the sond 
support. Aoid-labfle linkers are presently preferred chemically cleavable 

25 Rfikers for mass spectrometry, espedally MALDl-TOF MS, because the 
acid labile bond is cleaved during conditioning of the nucleic acid upon 
addition of the 3-HPA nrjatrix solutton. The acid labile bond can be 
Introduced as a separate linker group, e^ the acid labile trityl groups or 
may be incorporated in a synthetic nucleic acid linker by introducing one 
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or more silyl Internudeoside bridges using dtisopropylsiiyi, thereby 
forming diisopropyteayMlnked oligonucleotide analogs. The 
diisopropylsUyl bridge replaces the phoshodiester bond in tlie DNA 
backbone and under mildly acidfc conditions, such as 1.6% 
5 trifluoroacetic acid (TFAl or 3-HPA/T % TFA MALDI-TOF matrix solution, 
results in the introduction of one or more intra-strand breaks in the DNA 
molecuie. Methods for the preparation of dlisopropyisilyl-linked 
oCgonucleotide precursors and analogs are known to those of si^iJf in the 
art (see e^, Saha et aL (t993} J, Org. Chem. £8:7827-7831), These 

10 aligonucieotide analogs may be readily prepared using solid state 
oligonucleotide synthesis methods using diisopropylsityl derivatized 
deoxyribonucleosides. 
Modification of nucJeic ecids 
A, Mass modlficatfon 

15 in certarn embodiments, nucleic acids modified at positions other 

than the 3*- or 5'- terminus can be used. Modification of the sugar 
moiety of a nucleotide at positions other than the 3' and 5' position rs 
possible through conventional metiiods. Also, nucleic acid bases can be 
modified, sua., as described in F. EcksteiOr ed., " Olfaonucieotides and 

20 Anatoques! A Praeticai frpprn»rh |rl press (1991). Such a linker arm 
can be modified to include a thiol moiety. Alternatively, backbone- 
modified nucleic acids can ba used so that the thfol group can be 
attached to the nitrogen center provided by the modified phosphate 
backbone. 

25 In preferred embodiments, modification of a nucleic actd, such as 

described above, does not substantially Impair the ability of the nucleic 
acid molecule to hybridize to its complement. Thus, any modificatron 
should preferably avoid substantially modifying the funqiionalities of the 
nucleic acid that are responsible for Watson-Crick base pairing. The 
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nucleic acid can be modified euch that a non-terminal thiol group rs 
present, and tine nucleic acid, when immobilized to the support, is 
capable of self-complementarv base pairing to form a "hairpin" structure 
having a duplex region. 
5 B. Other modifying RNA analogs 

In practicing the methods described herein, a sat (or sets) of 
nested base-specific chain terminated RNA transcripts is (are) generated 
during transcription by the incorporation of a modified base-specific chain 
terminating ribonucleotide analog {or specific cleavage by RNases in a 
10 modified JVlaxam-GHbert strategy). Any ribonucJeoside triphosphate 
analog that results In the sequance-speoific arrest of transcription 
elongation upon incorporation Into an RNA molecule by an RNA 
polymerase may be used m the methods herein. Presently preferred 
ribonucleotide analogs are 3'-deoxyrlbonucleotidBs. result In base- 
IB specific termination of transcription (e.g.. see Axelrod et al. (1985) 
pipffherTilstn t 21:5716-6723; Tyagarajan fit al, (1 98&) BtochBmistrv 
2Q:1 0920-1 0924). 

Ir> certain embodiments, in addition to a base-specific chain 
terminating ribonucleoslde triphosphate analog, additional ribonucleotide 

20 analogs can be added to reduce the secondary structure of the resulting 
RNA transcript end/or to prevent premature transcription termination or 
pausing or RNA cleavage. For example, the incorporation of riboinosine 
using inoslne 5'-triphosphate is known to reduce the secondary structure 
of RNA products. In the presence of a dinucleotide guanine initiator, 

25 fnoskie 5'-triphosphate can effectively substitute for GTP in in vitro 

transcription reactions le^g^ Axalrod ej sL (1985) &3si3Sn^Sl£iL M'^57^ 6- 
5723). 

In addition, modified ribonuclecttde analogs may be added to the 
transcription mixture to increase the efficiency of transcriptional 
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termination and/or transcript release to promote and facilitate tha rate 
enzyme turnover. For example, the addition of 4-thjo UTP, S-bromo UTP, 
5-iodo CTP alter the hydrogen bondhg of the nucleic acid facilitating, at 
Jeast with some RNA polyn>6rase$, transcriptional termination and 
B transcript release. 

Solid supports and substrates 

Examples of Insoluble supports and substrates for use herein 
include, but are not llmitad to: beads, such as silica ge\. controlled pore 
glass, magnetic beads, dextran and agarose (Sephadex and Sepharose) 

10 beads, cellulose beads and othens, capillaries, flat suppons such as glass 
fiber filters, glass surfaces, metal surfaces (steel, gold, sHver, aluminum, 
silicon and copper), plastic materials mcluding muitiwell plates or 
membranes (fljL, of polyethylene, potypropylane, polyamide. 
polyvJnyWenedlfloorlde}, wafers, combs, pins ( e.g.. arrays of pins 

15 suitable for combinatorial synthesis or analysis) or beads in pits of flat 
surfaces such as wafers l e.a.. silicon wafers), with or without filter 
plates. 

Mass spectrometry 

Once transcription Is complete, the RNA transcripts can be 

20 analyzed by any of a variety of means Including, for example, 
speotromatric techniques such as UV/VIS, IR, fluorescence, 
ohemliumineseanee, or NMR spectroscopy, mass spectrometry, gel 
electrophoresis and other methods known Jn the art, or combinations 
thereof. Mass spectrometry is preferred herein. Mass spectrometer 

25 formats include the Ionization (1) techniques, such as matrix assisted 
laser desorption (MALDI), continuous or pulsed electrospray I ESI) and 
related methods (e-g, lonspray or Thermospray), and massive cluster 
impact |I«CI): these ion sources can be matched with detection formats 
including linear or r flectron time-of-fnght (TOF). single or multiple 
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quadruplBi single or multipJe insgnetic sector, Fourier Transform ion 
cyclotron resonance (FTICR), ion trap, and combinations thereof to yield 
a hybrid detector (j^EL/ fen-trap/time'Of-fJIght). For Ionization, numerous 
matrix/Mravelength combinations {MALOD or solvent combinations {ESI) 
5 can be employed. 

Preparation of ONA arrays 

In preferred embodiments, nucleic acJd promoter-containing 
probes, prior to or subsequent to hybridization to target nucleic acids, are 
immobilized at to the surface of a solid support in an array format. 

to Particularly suitable metliods for forming these DNA arrays are those 
described herein and U.S. application Serial Nos. 08/746.053, 
08/787,639 and 08/786.988. The subject matter of each pending U.S. 
applications is herein incorporated by reference in its entirety and is 
summarized as follows. 

15 Figure 1 iUustrates one system for preparing arrays of sample 

nttterial for andysis by a diagnostic tool. Fig. l depicts a system 10 
that Includes a data processor 12, a motion controller 14, a robotic arm 
assembly 16. a monitor element 18A, a central processing unit 188, a 
microliter plate of source material 20, a stage housing 22, a robotfc arm 

20 24. a stage 26, a pressure controller 28, a conduit 30, a mounting 

assembly 32, a pin assembly 38, and substrate elements 34. in the view 
sliown by Fig. 1 , it is also illustrated that the robotic assembly 16 can 
inokide a moveable mount element 40 and a horizontal slide groove 42. 
The robotic ann 24 can optionally pivot about a pin 36 to Increase the 

25 travel range of the ami 24 so that arm 24 can disposes the pin assembly 
38 above the source plate 20. 

The data processor 12 depicted in Fig, 1 can be a conventional 
digital date processing system such as an IBM PC compatible computer 
system that is suitable for processing data and for executing program 
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instructions that will provide information for controliing the movement 
and operatron of tha robotic assembiy 1 6. The data processor unit 1 2 
can be any type of sy&tem suitable for processing a program of 
instructions signals that will operate the robotic assembly that is 
5 Integrated into the robotic housing 1 6. Optionally the data processor 12 
can be a mfcro-controlled assembly that is integrated into robotic housing 
16. In further alternative embodiments, the system 10 need not be 
programmable and can be a single board computer having a ftrmwere 
memory for storing instructions for operating the robotic assembly 1 6, 

10 In the embodiment deptcted in Rg. 1, there is a controller 14 that 

electronically couples between the data processor 1 2 and the robotic 
assembly 16. The depicted controller 14 Is a motion controller that 
drives the motor elements of the robotic assembly 1 6 for positioning the 
robotic arm 24 at a selected location. Addrtfonally, the controller 14 can 

15 provide instructions to the robotic assembly 1 6 to direct the pressure 
controller 28 to control the volume of fMd ejected from the individual pin 
elements of the deleted pin assembiy 38 » The design and constmction 
of the depleted motion controller 14 foilow^s from principles well known 
in the art of efeotrlcal engineering, and any controller element suitable for 

20 drwing the robotic assombry 16 can be practiced without departing from 
the scope thereof. 

The robotic assembly 16 depicted m Rg. 1 electronically couples 
to the controller 14. The depicted robotic assembiy 16 is a gantry 
system that includes an XY tabte for moving the robotic arm about a XY 

25 plane, and further includes a Z axis actuator for moving the robotic arm 
orthogonally to that XY plane. The robotic assembly 1 6 depicted in Fig. 
1 Includes an arm 24 that mounts to the XY stage which moves tha arm 
within a plane defined by the XY access. In the depicted embodiment, 
the XY table is mounted to the 2 actuator to move th entire table along 
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the Z axis orthogonal to the XY plane. In this way, the robotic assembly 
provides three degrees of freedom that aliows the pin assembly 38 to be 
disposed to any location above the substrates 34 and the source plate 
20 which are shown in Fig. 1 as sittrng on the stage 26 mounted to the 
5 robotic assemblv 1 6. 

The depicted robotic assembly 16 follows from principles well 
icnown in the art of electrical engineering and is just one example of a 
robotic assembly suitable for moving a pin assembly to tocations adjacent 
a substrate and source plate such as the depicted substrate 34. 

10 Accordingly, alternative robotic systems can be practiced following the 
descriptions herein without departing from the scope thereof. 

Fig. 1 depicts an embodiment of a robotic assembly 16 that 
includes a pressure controller 28 that connects via a conduit 30 to the 
mount 32 that connects to the pin assembly 38^ In this embodiment the 

15 mount 32 has an Interior channel for fluidic couplir^ of the conduit 30 to 
the pin assembly 38. Accordingly, the pressure controller 28 is fluidtcly 
coupled by the conduit 30 and the mount 32 to the pin assembly 38. In 
this way the cohtroHer 14 can send signals to the pressure controller 28 
to control selectively a flutd pressure delivered to the pin assembly 38. 

20 Fig. 2 depicts one embodiment of a pin assembly 50 suitable for 

practice with the system depicted in Fig, 1 which includes the pressure 
controller 28. In the depicted embodiment the pin assembly 50 includes 
a housing formed from an upper portion 52 and a lower portion 54 that 
are joined together by the crews 56A and 56B to define an interior 

25 chamber volume 58. Ffg. 2 further depicts that to flufdicly seal the 
interior chamber volume 58 the housing can include a seal element 
deleted In Fig. 2 as an O-ring gasket 60 that sites between the upper 
Wocic and the lower block 54 and surrounds completely the perimeter of 
the Interior chamber volume 58, Fig, 2 further depicts that the pin 
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assembly 50 includes a plurality of vesicles 62A-62D, each of wNch 
Include an axial bore extending therethrough to form the depicted holding 
chambers 64A.64D. Each of the depicted vesicles extende through a 
respective aperture 68A-6eD disposed within the lower block 54 of the 
5 housing. 

As further shown in the depicted embodiment each of the vesicles 
62A-620 has an upper flange portion that sits against a seal element 
70A-70D to form a fluid-tight seal between the vesicie and the lower 
block 54 to prevent fluid from passing through the apertures 68A-68D. 

10 To keep the seal tight, the depicted pin assembly 50 further includes a 
set of biasrng elements 74A-74D depicted In Fig. 2 as springs which, in 
the depicted embodiments, are in a compressed state to force the flange 
element of the vesicles 62A-62D against their respective seal elements 
70A-70D. As shown in Rg. 2, the biasing elements 74A-74D extend 

15 between the vesicles and the upper block 52, Each of the sprigs 74A- 
74D can be focedly mounted to a moumfng pad 7GA-76D where the 
spring elements can attach to the upper block 52. The upper block 52 
further includes an aperture 78 depicted In Fig. 2 as a centrally disposed 
aperture that includes a threaded bore for receiving a swagelok 80 that 

20 can be rotatably mounted within the aperture 78. 

As further depicted in Fig, 2, the swagelok 80 attaches by a 
conduit to a valve 82 than can connect the swagelok 80 to e conduit 84 
that can be coupled to a pressure source, or alternatively can couple the 
swagelok 80 to a conduit 86 that provides for ventmg of the interior 

25 chamber 58. A cemrai bore 88 extends through the swagelok 80 and 
couples to the tubing element which further connects to the valve 82 to 
thereby fluidicly and selectively couple the interior chamber volume 58 to 
eKher a pressure source, or a venting outlet. 
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The pin assembly SO described above and depicted In Fig. 2 
disposed above a eubstrate element 90 that includes a plurality of welis 
92 that are etched into the upper surface of the substrate 90. As 
iliustrated by Fig, 2, the pitch of the vesiclee 62A-62D is such that each 
5 vesicle is spaced from the adjacent vesicles by a distance that Is an 
integral multiple of the pitch distance between wells 92 etched into the 
upper surface of the substrate 90. As will be seen from the following 
description, this spacing facifitates the paraJlel dispensing of fluid, such 
that fkjid can be dispensed into a plurality of wells In a single operation. 

10 Each of the vesicles can be made from stainless steel, silica, polymeric 
materiel or any other material suitable for holding flind sample. In one 
example, 16 vesicles are employed In the assembly, which are made of 
hardened beryllium copper, gold plated over nickel plate. They are 43.2 
mm long and the shaft of the vesicle Is graduated to 0.46 mm outer 

15 diameter with a concave tip. Such a pin was chosen since the pointing 
accuracy can be approximately 501 micrometers. Any suitable pin style 
can be employed for the device, including but not limited to flat, star- 
shaped, concave, pointed solid, pointed semi-hollow, angled on one or 
both sides, or other such geometries. 

20 Fig, 3 shows from a side perspective the lower block 54 of the pin 

assembly 50 depicted in Rg. 2. Fig. 3 shows approximate dimensions 
for one pin assembly. As shown, the lower block 54 has a bottom plate 
98 and a surrounding shoulder 100, The bottom plate 98 is 
approximately 3 mm In thicicness and the shoulder 100 is approximately 

25 5 mm in thickness. 

Fig. 4 shows from an overhead perspective the general structure 
and dimensions for one lower block 54 suitable for use with the pin 
assembly for use with the pin assembly 50 shown in Fig, 2. As shown 
in Rg. 4, the low r block 54 includes a four-by-four matrb< of apertures 
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68 to provide 1 6 apertures each suitable for receiving a vesicie. As 
described above with reference to Fig. 2. the spacing between the 
aperture 68 is typically an integral multiple of the difitance between wells 
on a substrate surfece as well as the wells of a source plate. 
5 Accordingly, a pin assembly having the lower block 54 as depicted in 
Rg. 4 can dispense fluid in up to 16 wells simultaneously. Frg, 4 also 
shows general dimensions of one lower block 54 such that each side of 
block 54 is generally 22 mm In length and the pitch between aperture 68 
Is approximately 4.6 mm. Such a pitch is suitable for use with a 

10 substrate whore fluid is to be dispensed at locations approximately 500 
iJtn apart as exemplified by the substrate 90 of Fig. 2. Fig. 4 also 
shows that the lower block 54 can include an optional 0-ring groove 94 
adapted for receivhg an O-ring seal element, such as the seal element 60 
depicted In Rg. 2, It is understood that such a groove element 94 can 

15 enhance and improve the fluid seal formed by the seal element 60. 

The pinblock can be manufactured of stainless steel as this 
material can be drnied accurately to about +25 pm. but a variety of 
probe materials can also be used, such as GIO laminate, PMMA or other 
suitable material. The pin block can contain any number of apertures and 

20 is shown with 1 6 receptacles which hold the 1 6 pins in place. To 

increase the pointing accuracy of each pin, an optional alignment place 
can be placed bek)W the block so that about 6 mm of the pfn tip is left 
exposed to enable dipping Into the wells of a microtlter piate. The layout 
of the probes rn the depicted tool is designed to coordinate with a 384- 

25 well microliter plate, thus the center-to-center spacing of the probes in 
4.5 mm. An array of 4 x 4 probes was chosen since It would produce an 
array that would fit in less than one square inch, which is the travel 
range of an xy stage of a MALDI TOF MS ennproyed by the assignee. 
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The pintool assembly is completed with « stainless steel cover on the top 
«ide o* the device which is then ettacheef onto the Z-arm of the robot. 

With reference to Rgs 1 and 2, the robotic assembly 1 6 employe a 
pin tool assembly 38 that is configured similarly as the pin tool assembly 
5 50 depioted in Figure 2. The pressure controller 28 selectively controls 
the pressure within chamber 58. With this embodvnant, a control 
program operates on the data processor 12 to control the robotic 
assembly 1 6 in a way that the assembly 16 prints an array of elements 
on the substrates 34. In a first step, (see, also figures 2 and 5), the 
10 program directs the robotic assembly 16 to move the pin assembly 38 to 
be disposed above the source plate 20. The robotio assembly 16 wiH 
then dip the pin assembly into the source plate 20 which can be a 384 
well DNA source plate. As shown In Rg, 4 the pin assembly can include 
1 6 different pins such that the pin assembly SO wilt dip 1 6 pins into 
15 different 1 6 wells of the 384 well DNA source plate 20. Next the data 
processor 1 2 wiil direct the motion controller 14 to operete the robotic 
assembly 16 to move the pin assembly to e position above the surface of 
the substrate 34. The substrate 34 can be any substrate suitable for 
receiving a sample of material and can be formed of silicon, plastic, 
20 metal, or any other such suitable material. Optionafly the substrate will 
have a flat surface, but can alternatively include a pitted surface, a 
surface etched with wells or any other suitable surface typography and 
further containing beads on the surface of each well. The program 
operating on data processor 12 can then direct the robotic assembly, 
26 through the motion controHer 14, to direct the pressure controller 28 to 
generate a positive pressure within the interior chamber volume 58. In 
this practice, the positive interior pressure will force fluid from the 
holding chambers of vesicles 62 to eject fluid from the vesicles and into 
a respective well 92 of the substrate 90. 
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The program operating on data processor 1 2 car> atso direct the 
controller 14 to control the pressure controller 28 to control filling the 
holding chambers with source material from the source plate 20. The 
pressure controller 28 can generate a negative pressure within the 
5 interior chamber volume 58 of the pin assembly. This will causa fluid to 
be drawn up into the holding chambers of The vesicles 62A-62D. The 
pressure controller 28 can regulate the pressure either by open-loop or 
closed-foop control to avoid having fluid overdrawn through the holding 
chambers and spilled into the interior chamber volume 58- Loop control 

to systems for controlling pressure are well known in the art and any 
suitable controller can be employed. Such spillage could cause cross- 
contamrnatlon, particularly if the source material drawn from the source 
plate 20 varies from well to well. 

in an alternative embodiment, eech of the holding chambers e4A- 

15 64D is sufficiently small to allow the chambers to be filled by capillary 
action. In such a practice, the pin assembly can include an array of 
narrow bore needles, such as stainless ateel needles, that extend through 
the apertures of the lower block 64. The needles that are dipped into 
source solutions will be filled by capillary action. In one practice, the 

20 length of oapfllary which is to be filled at atmospheric pressure is 
determined approximately by: 

H = 2k 
PGR 

where H equals Height, gamma equate surface tension, P equals solution 
25 density, G equals gravitational force and R equals needle radius. Thus 
the volume of fluid held by each vesicle can be controlled by selectvig 
the dimensions of the interior bore. It is understood that at room 
temperature water will fill a 15 cm length of 100 pm radlua capillary. 
Thus, a short bore nanoliter volume needle will fill to fuJI capacity, but 
30 should not overflow because the capillary force is understood to be too 
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small to form a meniscus at the top of the needle orifice. This prevents 
cross-contamination due to spillage. In one embodiment, the vesicles of 
the pin assembly can be provided with different sized interior chambers 
for holding and dispensing different volumes of fluid. 
5 In an altemBtive practice, to decrease the volume of liquid that is 

drawn into the holding chambers of the vesicles, a small positive 
pressure can be provided within the interior chamber volume SB by the 
pressure controlier 28, The downward force created by the poative 
pressure can be used to counter the upward capillary force. In this way. 

1 D the volunoe of fluid that is drawn by capillary force into the holding 
chambers of the vesicles can be controlled. 

Fig. SB shows that fluid within the holding chambers of the needle 
can be dispensed by a small positive pressure introduced through the 
central bore 8B extending through a awagelok 80. By regulating the 

1 5 pressure pulse that Is introduced Into the interior chamber volume 58, 
fluid can be ejected either as a spray or by droplet formation at the 
needle tip. It Is understood that the rate of dispensing, droplet versus 
spray, depends in part upon the pressure applied by the pressure 
controller 28. in one practice, pressure is applied in the range of 

20 between 10 and 1 ,000 Torr of atmospheric pressure. 

To this end the data processor 1 2 can run a computer program 
that controls and regulates the volume of fluid dispensed. The program 
can direct the controller 28 to eject a defined volume of fluid, either by 
generating a spray or by fonnlng a drop that sits at the end of the 

25 vesicle, and can be contacted with the substrate surface for dispensing 
the fluid thereto. 

Rgures 5C and SD show the earlier steps shown in Figs. 5A-5B 
can again be performed, this time at a positron on the substrata surface 
that is offset from th earlier position. In the depicted process, the pin 
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tool is offset by a distance equal to the distance between two wells 92. 
Other offset printing techniques can be employed. 

Several advantages of the pin assembfy depicted in Fig. 2 are 
achieved. For example, rinsing between dispensing events ts straight* 
5 for ward r requiring only single or multiple pin fillings and emptying events 
with a rinse solution. Moreover, since all holding chambers fill to full 
capacity, the accuracy of the volumes dispensed varies only according to 
needle inner dimensions which can be carefully controNed during pin 
production. Further the device is cost effective, with the greatest 

10 expense attributed tq the needles, however because no contact with a 
surface is required, the needJes are exposed to littie physical strain or 
stress, maicing replacement rare and providrng long life. 

Alternath;ely, deposition of sample material onto solid support 
surface can include techniques that employ pin tool assemblies that have 

15 soird pin elements extending from a block wherein a robotic assembly 
dips the solid pin elements of the pin assembly into a source of sample 
material to wet the distat ends of the pins with the sample materials. 
Subsequentiy the robotic assembly can move the pin assembly to a 
location above the substrate and then lower the pin assembly against the 

20 surface of the substrate to contact the individual wetted pins against the 
surface for spotting material of the substrate surface. 

Pin tool assemblies that have solid pins with a promoter-containing 
nucleic acU immobilized thereon can be dipped into wells containing a 
reaction mixture for hybridiratlon end/or generation of RNA transcript. 

25 The liquid with the transcripts can then be deposited in an array as 

discussed above, for example by using solid pins of a pin tool assembly 
to spot sample on the substrate surface. 

Figures 6A and 68 depict another alternative system for dispensing 
material on or to the surface of the substrate, in particular, Figure 6A 
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depicts a jet priming device ITO which includee a capillary element 1 12, 
a transducer eleniBnt 1 14 and orifice (not shown) 1 1 S, a fluid conduit 
122, and a mount 124 connecting to a roboirc arm assembly, such as 
the robotic arm 24 depicted in Figure T. As further shown in Rgure 6A 
5 the jet assembly 1 1 0 is suitable for ejeetino from the orifice 118a series 
of drops 120 of a sample material for dispensing sample material onto 
the surface 128. 

The oapillarv 1 1 2 of the jet assembly 1 10 can be a glass capillary, 
a plastic caprtlary. or any other suitable housing that can carry a fluid 

70 sample and that will allow the fluid sample to be ejected by the action of 
a transducer element, such as the transducer element 114. The 
transducer element 1 1 4 depicted in Figure 6 A is a plezo electric 
transducer element which forms around the parameter of the capillary 
1 12 and can transform an electrical pulse received from the pulse 

16 generator within a robotic assembly 1 6 to cause fluid to eject from the 
orifice 1 18 of the capillary 1 12, One such Jet assembly having a 
piezoelectric transducer element Is manufactured by MicroFab 
Technology, Inc., of Qemnany. Any Jet assembly, however, that is 
suitable for dispensing defined and controlled the volumes of fluid can be 

20 used herein including those that use piezoelectric transducers, electric 
transducers, electrorestrlctive transducers, magnetorestrtctrve 
transducers, electromechanical transducers, or eny other suitable 
transducer element. Jn the depicted embodiment, the capillary 112 has a 
fluid conduit 1 22 for receh^ing fluid material. In an optional embodiment, 

25 fluid can be drawn into the capillary by action of a vacuum pressure that 
wilf draw fluid through the orifice 118 when the orifice 1 1 8 is submerged 
in a source of fluid material. Other embodiments of the jet assembly 1 10 
can be practiced with the invention without departing from the scope 
thereof. 
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Pigure 6B Htustrates a further aHarnath/e assembly suitabre for p 
being carried on the robotfc arm of a robotic assembly, such as the 
assembly 1 6 depicted in Figure 1 . figure 6B illustrates four jet 
assembnee connected together, 130A-1 30D. Similar to the pin assembly 
5 in Figure 2. the |6t assembly depicted in Figure 6B can be employed for 
the parallel dispensing of ffuid material. Each of the Jet assemblias 130A- 
130D can be operated rndependantly of the others, allowing the saleotive 
disper^eing of fluid from select ones of tha |et assemblies. Moreover, 
each of the jet aesembliee 130A-130D can be independently controHed to 
10 select the volume of fluid that is dispensed from each respected one of 
the assembly 1 30A-130D. Other mcdHlcations and alterations can be 
made to the assembly depicted in Figure 6B whhout departing from the 
scope of the mvention. 

Arrays can be formed on a substrate surface according to any of 
15 the techniques discussed above. The sample arrays are then analyzed by 
mass spectrometry to collect spectra data that is representative of the 
composition of the samples In the array, It is understood that the above 
methods provide processes that allow for rapidly dispensing defintte and 
controlled volumes of analyte material. In particular these processes 
20 allow for dispensing sub to bw nanoliter volumes of fluid. These low 
volume deposition techniques generate sample arrays well suited for 
analysis by mass spectrometry. For example, the low volumes yfeld 
reproducibility of spot characteristics, such as evaporation rates and 
reduced dependence on atmospheric conditions such as ambient 
25 temperature and light. 

Continuing with the example shown in Fig. 5, the arrays can be 
prepared by loading oligonucleotides fO.1-50 ngljj\) of different 
sequences or concentrations into the wells of a 96 well microtiter source 
plate 20; the first well can be reserved for holding a matrix solution. A 
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substrate 34, such as a pitted silicon chfp substrate, can be placed on 
the stage 26 of the robotics assembly 16 and can be aligned manually to 
orient the matrix of wells about a set of reference axes. The control 
program executing on the data processor 12 can receive the coordinates 
5 of the first well of the source plate 20. The robotic arm 24 can dip the 
pin assembly 38 into source plate 20 such that each of the T 6 pins is 
dipped Into one off the wells. Each vesicle can fill by capiUary action so 
that the fuli volume of the hording chamber contains fluid. Optionally, 
the program executing on the data processor 1 2 can direct the pressure 
10 controller to fill the Interior chamber 58 of the pin assembly 38 with a 
positive bias pressure that wIN counteract, in part, the force of the 
capiliary action to limit or reduce the volume of fluid that is drawn Into 
the holding chamber. 

Optionally, the pin assembly 38 can be dipped into the same T8 
1 5 walls of the source plate 20 and spotted on a second target substrate. 
This cycle can be repeated on as many target substrates as desired. 
Next the robotic arm 24 can drp the pin assembly 38 in a washing 
solution, and then dip the pin assembly ir«o 16 different weils of the 
source plate 20. and spot onto the substrate target offset a distance 
20 from the initial set of 16 spots. Again this can be repeated for as many 
target substrates as desired. The entire cycle can be repeated to make a 
2x2 array from each vesicle to produce an 8x8 array of spots {2x2 
elements/vesicle x 1 6 vesicles = 64 total elements spotted). Any 
process suitable for forming arrays can be employed. 
25 ORgonucleotides of different sequences or concentrations can be 

loaded into the weils of up to three different 384.weli microtiter source 
Plates: one set of 16 wells can be reserved for matrix solution. The 
wells of two Plates are filled with washing solution. Rve microtiter 
plates can be loaded onto the stage of the robotic assembly 1 6. A 
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plurarrrv of target substrates can be placed abutting an optfonal sat of 
banking or registration pins disposed on the stage 26 and provided for 
aligning the target substrates along a set of reference axes. If the matrix 
and oligonucfeotJde are not pre-mixedp the pin assembly can be employed 
5 to first spot matrix solution on alf desired target substrates. In a 

subsequent step the oligonucleotide solution can be spotted fn the same 
pattern as the matrix materiel to re-dlssotve the matrix. Alternatively, a 
sample array can ba made by placing the oJigonucreotide solution on the 
wafer first, followed by the matrix solution, or by pre-mbcing the matrix 

10 and oligonucleotide solutions. 

After depositing the sample arrays onto the surface of the 
substrate, the arrays can be analyzed usJng any of a variety of means ' 

spectrometric techniques, such as UV/VIS, fR, fluorescence, 
chemiluminescence, NMR spectrometry or mass spectrometry. For 

15 example, subsequent to either dispensing process, sample loaded 

substrates can be placed onto a MALDI-TOF source plate and held there 
with a set of beveled screw mounted polycarbonate supports. In one 
practice, the plate can be transferred on the end of a probe to be herd 
onto a 1^ resolution, 1 " travel xy stage (Newportl In the source region 

20 of a time-of-flight mass spectrometer. Any suitable macs spectrometry 
tool can be employed. 

Preferred mass spectrometer formats for use whh the arrays 
described herein include Ionization {I) techniques rncluding but not limited 
to matrix assisted leser desorption <MALDI), continuous or pulsed 

2B electroapray (ESI) and related methods (g^ lonspray or Thermospray), or 
massive cluster impact (MCI); those ion sources can be matched with 
detection formats including linear or non-linear reftectron time-of«flight 
{TOF), single or multiple quadruple, single or multipte megnetic sector, 
Fourier Transform ion cyclotron resonance (FTICR), ran trap, and 
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combinatjons thereof {e.g.. ion-traprtime-of-flight). For ionization, 
numerous matrix/wavelength combinations (MALDI) or solvent 
combinations (ESI) can be employed. Subattomola levela of protein have 
been detected for exan^le, using ESI IValaskovIc, Q. A. et al., {1996) 
5 S£!finfifi223: 11 99-1202) or IWALDI (U. L et al., 11986) J. Am. Cham. 
SSfiHg: 1662-1663) masa spectrometry. 

Thus, it will be understood that In processes described herein a 
completely non-contact, high-pressure spray or partial-contact, low 
pressure droplet formation mode can be employed. In the latter, the only 
10 contact that will occur is between the droplet and the walls of the well or 
a hydrophilic flat surface of the substrate 34. In neither practice need 
there be any contact between the needle tfp and the surface. 
Exemplary embodiment 

In one pref^red embodiment, a double-stranded nucleic acid 
15 sequence encoding a promoter sequence is isolated from a natural source 
{&SL. bacteria, viruses, bacteftophages, plants or eukaryotic organisms) 
or assembled from synthetic sequences. A single-stranded region of at 
least a 5 nucleotides at tha 3' -end of the coding strand using standard 
methods tcnown to those of skiU in the art (e.g.. see Sambrook et aL, 
20 {1989) Molecular Cloning, 2nd ed., Cold Spring Harbor Laboratory Press, 
New York). This single-stranded region is designed such that It is 
complementary to a region of the nucleic acid to he sequenced or to an 
sequence shared between the tv/o nuclete acid molecules ( e.g.. g 
restriction endonuclease sitel. 
25 The nucleic acid to be sequenced containing at least a partially 

single-stranded 3'-end is hybridized according to the conditions described 
herein, preferably high stringency such that 5 contiguous matching base 
pairs form, and known to those of skill in the art to the complementary 
sequences of the promoter-containing DNA. The nucleic acid to be 
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sequenced may be double-stranded or preferably single-stranded. The 
hybridization of the two nucleic acid molecules Introduces one or more 
"nicks" fn the hybrid at the junotionis) of the adjacent nuoJeio acid 
molecules. Nicks in the coding or non-coding strand, preferably the 
5 coding strand, can be ligated by the addition of an appropriate nudeic 
acid ligase prior lo initiating transcription. Methods for llgating nucleic 
acids are well known to those of skll] in the art Ib.q., see Sambrook et 
aL, (1989) Molecular Cloning, 2nd ed.. Cold Spring Harbor Laboratory 
PresSr New YorkI and DMA and RNA ligases are commercially available 

10 (e^. Boehringer Mannheim, Indianapolis, IN). 

Transcription is inrtiated from the promoter by the addition of the 
appropriate RNA polymerase in the presence of ribo nucleoside 
triphosphates under conditions described herein and known elsewhere 
ISA. In PNA Polymerase and the Regulation of Tr anscription. Rezfnkoff 

15 et al... eds. Elsevier, NY). In preferred embodiments, a selected base- 
specific chain terminating S'-deoxyribonticleoside triphosphate and the 
transcription rraxture also contains moslne 5'-triphasphate to reduce the 
secondary structure of the RNA product or modified ribonucleoside 
triphosphates, such as 4-thio UTP, S-bromo UTP or 5'-lodo CTP to 

20 facilitate turriover of the RNA polymerase enzyme and thereby increase 
the amount of RNA transcript available for analysis. 

In preferred embodimentSr the nucleic acid promoter-containing 
probe is covalently immobilized on a silica support by functlonalization of 
the support with an amino functionality (e.g., by derivatization of the 

25 support with a reagent such as a-amfnopropyl-triethoxysilane (Aldrich 
Chemical Co., Milwaukee. Wl); see Figure 7). Other funotianalized 
oxysilanes or orthosilicates can be used, and are commercially available 
{e^ from Gelest, Jnc, Tullytown, PA)- For example, 3-mercaptoprapyl- 
triethoxysilane can be used to functionalize a srircon surface with thiol 
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groups. The smino-functronalized silica can then be reacted with a 
heterobtfunctlonal reagent such as N-succinimidyi (4-iodacetvl) 
aminobenzoate (SIAB) (Piefce. Rockford, ID. Other homo- end hetero- 
blfunciional reagents which can be employed are 8va[labia commerclallVr 
5 ejL.< from Pierce. Finally, a nucJeIc acid functionailzed with a thiol group 
iSJLf at the 5'-termlnus) is covalentJy bound to the derivatized silica 
support by reaction of the thiol functionality of the nucleic acid molecule 
with the iodoaeatyl functionality of the support. 

In certain embodiments, the nudeic acid can be reacted with the 

10 cross-linking reagent to form a cross-linker/nucteic acid conjugate, which 
is then reacted with a functionalized support to provide an immobilized 
nudetc acid. AJternatlvdy, the cross-linker can be combined with tfie 
nucleic acid and a iunctionalized solid support in one pot to provide 
substantially simultaneous reaction of the cross-linking reagent with the 

15 nucleic acid and the solid support. In this embodiment, ft will generally 
be necessary to use a heterobifunctional cross-lfnker, i.e., a cross-Jinker 
with two different reacth/e functtonaiities capable of selective reaction 
with each of the nucleic acid and the functionalized solid support. 
Foitowing the methods described herein, spatially-addressable 

20 arrays of nudaio acMs rmmobllized on insokible supports that are suitable 
for sequencing of nucleio acids using RNA polymerase can be prepared. 
For Bxample, the methods can be used to provide arrays of different 
nucleic acids immobilized on pins arranged In an array. In another 
embodiment a photo-cleavable protecting group on the insoluble support 

26 can be selectively cleaved (g^atr by photolithography) to provide portions 
of a surface activated for immoblJization of a nucleic acid. For example, 
a silicon surface, modified by treatmem vAOi 3-mercaptopropyl- 
trrethoxysilane to provide thiol groups, can be blocked with a 
photocleavable protecting group <for examples of photodeavable 
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protocting groups, see, e.g.. PCT Publication WO 92/10092, or McCray 
et al..| 1 989) Ann. Rev. B ionhva. Blophvs. Chem. 18:239-270), and be 
seiectiveiy deblocked by irradiation of selected areas of the surface, sJLr 
by use of a photolithography mask. A nucleic acid promoter-containing 
5 probe modified to contain a thiol-reactive group can then be anached 
directly to the support, or. alternatively, a thiol-reactive cross-liniting 
reagent can be reacted vnth tiie thioi-modified support, followed by (or 
substarttially simultaneouslY wilh} reaction with a nucleic acid to provide 
immobilized nucleic acids. A nucleic acid base or sequence, once 

10 immobilized on a support according to the methods described herein, can 
be further modified according to known methods. For example, the 
nucleic acid molecule can be lengthened by performing solid-phase 
nucleic acid synthesis according to conventional techniques, including 
combinatorial technique 

1 5 Preferably the nucleic acids axe covalentiy bound to a surface of 

the insoluble support through at least one sulfur atom, Lfi« the nucleic 
ackis are covalentiy bound to the surface through a linker moiety which 
includes at least one sulfur atom. Such covalentiy bound nucleic acids 
are readily produced by the methods described herein. In preferred 

20 embodiments, ihe covalentiy bound nucfek: adds are a present on the 
surface of the insoluble support at a density of at least about 20 
fmol/mm\ more preferably at least about 75 fmol/mm^, still more 
preferably at least about fmoi/mml yet more preferably at least about 
100 fmol/mm^ and most preferably at least about 1 60 fmol/mm^. 

25 Applications 

The methods of sequencing nuol^c ackis described herein may be 
used for a variety of end use applications. For example, the methods 
may be used fbr diagnostic applications for the identification of 
mutations, such as transitions, trensverslons, deletions, insertions and 
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the like. The methods may also be used to assist in the diagnosis of a 
number of gsnetic disorders, used for genetic screening or the 
determination of heredity or tissue or organ oompatibiiity. 

The method of sequencing may be used for diagnostic applications 
5 to determined the presence of genetic alterations in a known target 
nudoic acrd. For example, a region of the target nucieiQ acid can be 
amplified using standard methods, such as PGR or other amplification 
methods Icnown to those of skill in the art te.^ .see Sambrook fit aL, 
(19891 Molecular Cloning, 2nd ed., Cold Spring Harbor Laboratory Preas, 

10 New York). The an^nfled nucleic acid can be denaturated and the strand 
to be sequenced (the noncoding strand, Is Isolated or may be used as a 
double-stranded molecule. 

The presence of genetfc alterations in a known target nuciete is 
determined in a diagnostic application by isolating (or amplifying) a target 

15 nucleic acid from a biologioaf sample and hybridizing the target to a 
promoter containing capture probe to generate a promoter containing 
template. A nested set of RNA transcripts is generated and the 
sequence of the target nuclete acid determined and compared with a 
wiWtype sequence to determine If a genetic alteration has occurred in the 

20 target nucleic acid. The presence of a genetic alteration can be 

determined from the RNA transcripts by other means known in the art 
(see, U.S. Patent No. 5,605 J98 and International PCT application 
No. WO 98/200191. 

Methods of Identifying Transcriptional Terminator Sequences 
25 Methods of Identifying heretofore unknown transcriptional 

terminator and attenuator sequences are also provided. Transcriptional 
tenninator sequences are responsible for the sequence specific cessation 
of RNA elongation and such terminator sequences from a variety of 
organisms have been reported. For example, bacterial rho-independent 



SUBSTTTUre SHEET (RULE 26} 



FCT/US92L'267l8 



-66- 

terminator sequences have two inverted repeats separate by several base 
pairs foiJowed by a 5-10 nt polyT stretch. Upon transcription through 
these sequences r the resulting mRNA transcript forms a RNA hafrpin 
stem-loop secondary structure behind the RNA polymerase nnolecule that 
5 increases pausing of RIMA polymerase and/or destabtlrzes the RNA 
polymerase-DNA template interaction resulting rn termination of the 
transcript within the DMA pofyT stretch, (e^, see Wilson and von Hippel 
(19; Reynolds et al. (1992a) JJ^oL^BioL 224:53-63; Reynolds etaL 
(1992b) J, MoL BloL 224!3US1i Telesnftsl<y et aL H 989) Biochemistry 
10 28:5210-5218; d'Aubenton Carafa ej aL (1990) J- IVIqJ. Biol. 216 :835- 
858). 

Bacterial rho-dependent terminators lack the traditional Inverted 
repeat stem-loop structure end further require additional factors, such as 
rho protein, to halt transcription te,g.. see Schmidt si fiU |1 984) J. Mol^ 

T5 BioL 259: 1 5QO0-1 5002). Rho-dependent termination is believed to result 
in the premature termination in bacterial species upon uncoupling of 
trenscription and translation » 

By modifying the standard transcription conditions described 
herein, transcriptional terminator sequences, e.g.. rho-dependent and rho- 

20 independent temninators, may be identified using mass spectrometric 
methods. In practicing the methods, a single-stranded region of the 3'- 
end of the nucleic acid to be sequenced is hybridized to a complementary 
sequence at the 3'-end of -die coding strand a promoter-containing 
nucleic acid probe. In preferred embodiments, the promoter-containing 

25 nucleic acid is covalently coupted via the 5' -end of the noncoding strand 
or 3'-end of the coding strand to a solid support and, more preferably, is 
a 5'- or S'-thiolated DNA linked at high densities to a aminosilane-treated 
solid support. 'The iinlcage may be In the absence or presence of a linker 
group and Is preferably arranged in an array format. 
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Transcription is initiated in the absence or presence of modified 
RNA triphosphate analogs that increase ths efficiency of RMA polymerase 
termfnation at such terminator sequences, such as 4-thio UTP, 5-bromo 
UTP or 6'-3odo CTP. The mass of the speciflcafly terminated RNA 
5 transcripts can be detected by mass spectrometry where the observed 
mass of the RNA is indicative of the Eocation of the terminator-dependent 
arrest of transcription. By comparing of the alignment of the sequence 
immediately preceding the site of transcriptional termrnation from several 
distinct genomic bcations, heretofore unknown term'cnator sequences 

10 may be identified for different RNA polymerases. 

In certain embodiments, nfolcs In one or more strand resulting from 
the hybridization of the nucleic acid to be sequenced may be ligated by 
the addition of an appropriate nucleic acid ligase prior to initiating 
transcription (Lg., adding a DNA or RNA ligase). 

15 The present invention is further iHustrated by the fallowing 

Examples, which area intended merely to further niustrate and should not 
be construed as limiting. The entire ccntenta of all the of the references 
(including literature references, Issued patents, published patent 
applications, and co-pending patent appJications) cited throughout this 

20 applicairon are hereby expressly incorporated by reference. 

EXAMPLE 1 

High Density Attachment of nucleic Acids 1o Silicon Wafers 
Materials and Methods 

AH reagents, unless otherwise noted, were obtained from Aldrich 
25 ChemicaJ^ Milwaukee, Wl. 
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oligodeoxy nucleotide upon cleavage of the disulfide. Various reduction 
methods were inve&tigeted to determine the optimal conditions* In one 
case, the disulfrde-conteining oligodeoxynucleotide (31.5 nmoi, 0.5 mM) 
was incubated with dithiothreitci (DTT) (Pierce ChemicaL Rockford, ID 
5 (6.2 mmol 100 mM) as pH 8.0 and 37 ""C. With the cleavage reaction 
essentiatiy complete, the free th1ol*containing oligodeoxynucleotide was 
isolated using a Chromaspin-IO column (Clontech, Palo Alto, CA) since 
DTT may compete in the subsequent reaction, Ahernatively, tri5-(2- 
carboxyethyl) phosphine (TCEP) {Pierce Chemicals Rockford, ID has been 

10 used to cJeave the disulfide. The disulfide-containing 

oligodeoxynucEeotide (7.2 nmol, 0.36 mM> was incubated with TCEP In 
pH 4.5 buffer at 37 °C. It is not necessary to isolate the product 
foUowing the reaction since TCEP does not competitively react with the 
iodoacetamido functionality. Varying concentrations of TCEP were used 

15 for the cleavage reaction to determine the optimal conditions tor the 
conjugation reaction. 
Probe Coupling 

To each wafer which had been derjvatized to contain tiie 
Iodoacetamido functionality as described above was added a 10 mM 

20 aqueous solution of the free-thiol containing oligodeoxynucleotide In 100 
mM phosphate buffer, pH 8; the reaction was allowed to proceed for a 
minimum of five hours at room temperature in 100% relative humidity. 
Following the reaction, the oligodeoxynucleotide solution was removed, 
and the wafers were washed two times in 5 X SSC buffer (75 mM 

25 sodfum cHrete, 750 mM sodium chloride, pH 7) with 50% formamrde 
{USB, Cleveland, OH) at 65^0 for 1 hour each. 
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Radlochemicai Determinaiion of Prabe Density 
In order to determine the amount of DNA covalently attached to a 
surface or the amount of a complementary sequence hybridized, 
rediolabeJed probes were employed. In cases where a 6 '•disulfide- 
5 containing oligodeaxynucleotide was to be immobilized, the S'-terminue 
wa« radiolabeled using terminal transferase enzyme and a radiolabeled 
dideoxynuoleoside triphosphate; in a standard reaction, 15 pmol (O.^M) 
of the 5'-disulfide-containing oligodeoxynucleotide was incubated with 
50/iCi (16.5 pmol. 0.66 pM) of (tf-"P| dldeoxyadenoslne-S'triphosphate 

10 IddATP) (Amersham. Arfington Height, ILI in the presence of 0.2 mM 2- 
mercaptoethanol. Upon the addition of 40 units of the terminal 
deoxynudeotidyl transferase enzyme (USB, Cleveland. OH), the reaction 
was allowed to proceed for one hour at 37»C. After this time, the 
reaction was stopped by immersion of the vial in 75'C water bath for ten 

1 5 minutes, and the product was isolated using a Chromaspln-1 0 column 
(Clontech, Palo Alto, CA). Similarly, a S'-dlsuffide-containlng 
oligodeoxynucleotide was radiolabeled with '^S. 

In cases where a 3'-disuJfide-contalning oligodeoxynucleotide was 
to be Immobilized, the 5'-terminus was radiolabeled using T4 

20 polynucleotide kinase end a radiolabeled nucleoside triphosphate. For 
example, 15 pmol \OSuM) of the 3'-dlsulfide-oontaintng 
oligodeoxynucleotide was incubated with 50/aCi (16.5pmal, 0.66)uM) of 
adeno8lne-5'trlphosphate <ATP| lAmersham. Arlington Height, ID 
In the presence of SO mM Tris-Ha, pH 7.6. 1 0 mM MgCl^, 10 mM 2- 

25 mercaptoethanol. Following the addition of 40 units of T4 polynuclBotide 
khase, the reaction was allowed to proceed for 1 hour at 37°C. The 
reaction was stopped by immersion of the vial in a 75<'C water bath for 
ten minutes; the product was then isolated using a Chromaspin-IO 
column {Ciontech. Palo Alto. CAi. 
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To determine the density of covalently immobilized probe, the 
disulfide-containing oligodeoxynucleotide of choice was added to a trace 
amount of the same species than had been radiolabeled as dascribad 
above. The disulfide was cleaved , the probe was immobilized on 
5 iodoacetamido-functionalized wafers, the wafers were washed, and then 
exposed to a phosphorimager screen {Molecular Dynamics, Sunnyvaler 
CA). For each different oligodeoxynucJeotide uaed, reference spots were 
made on polystyrene in which the molar amount of oligodeoxynucleotide 
was known; these reference spots were exposed to the phosphor imager 
10 screen as well. Upon scanning the screen, the quantity {In moles) of 
oligodeoxynucleotide bound to each chip was determined by comparing 
the counts to the specrfic acfivities of the references. 
Hybridization and Efficiency 

To a wafer that had been functionalized with an immobilized probe 
15 was added a solution of a complementary sequence HO pM) fn 1 M NaCI 
and TE buffer. The wafer and solution were heated lo 75*»C and allowed 
to cool to room temperature over 3 hours. After this time, the solution 
was removed, and the wafer was washed two times with TE buffer. 
To determine the amount of ollgonuoieotide hybridized, 
20 immobilization of the probe was first carried out as described above 
except that the probe was labeled with rather than ''P. The density 
of immobilized probe was determined with the phosphorimager. Next, 
the same wafer was incubated in TE buffer, 1 M NaCI, and Its 
complementary strand OOfM] which had been radiolabeled with *^P. 
26 Hybridization was carried out as previously described. FoHowing a wash 
to remove non-specific brnding, the wafer and reference were exposed to 
a phosphorimager screen with a piece of copper foH between the screen 
and the wafer: The copper foil serves to block the signal from ^S, while 
allov/ing the "P signal to pass freely. The molar amount of hybridized 
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oligonucleotide is then determined^ tlius revealing the percent of 
covalently immobilized probe that is avelloble for hybridization. 
MALOl-TOF Mass Spectrometrte Analysis 
As described above, wafers containing non-radlolabeled 
5 immobilized oiigodeoxynucleotide (name: TCUC; sequencer 

GAATTCGAGCTCGGTACCCGG; molecuiar weight; 6455 Da; SEQ ID 
NO. 1 ) were synthesized, and « complementary sequence (name: 
MJM6; sequence: CCGGGTACCGAGCTCGAATTC; moieeular weight: 
6415 Da; SEQ ID NO. 2| was hybridized. The wafers wera washed in 

10 50nrii\^ ammonium citrate buffer for cation exchange to remove sodium 
and potassium ions on the DMA backbone (Pieies, U. et al., {1 993) Nucl. 
Acids Res . 21:3191-31 96>. A matrix solution of 3-hydroxypicolinic acid 
13-HPA, 0.7 M in 50% aoetonitrile, 10% ammonium citrate; Wu, K.J., et 
af. (1993^ Raofd Commun. Mass Spaetrnm. 7-lil9-iAftt was spotted 

1 5 onto tlie wafer and allowed to dry at ambient temperature. The wafers 
were attached directly to the sample probe of a FInnlgan iVIAT (Bremen, 
Gemunyl Vision 2000 refiectron TOP mass spectrometer using a 
conducting tape. The refiectron possesses a 5 keV ion source and 20 
kaV po$t*aeeelarat'ron; a nitrogen laser was employed; and aii spectra 

20 were taken in the positive ion mode. 
Surface Chemistry 

Employing standard silicon dioxide modification chemistry, a silicon 
wafer was reacted with S-amlnopropvltrtethoxysaane to produce a 
uniform layer of primary amino groups on the surface. As shown in 
25 Figure 7. the surface was then exposed to a heterobifunctional 
crosslinker resulting in rodoacetamido groups on the surface. It was 
possible to determine the optimal reaction time of this reaction In solution 
using TLC. The SIAB crosslinker wss visualized under long wave 
ulirevi<riBt light 1302 nm) to reveal a spot with an R, value of 0.58. 3- 
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amlnopropyltriathoxysilane was nor active under ultraviolet light, 
therefore, nirihydrin was used to reveal a purple spot indicating the 
presence of a primary amirte at the ba$elir>e. A rrwcroscafe reaction was 
run using a slight moJar excess of SIAB \n comparison to 3- 
5 aminopropyJtrlethoxysilane; TLC analysis after approximately one minute 
revealed a new spot visible under long wave ultraviolet light with ar> Rf 
value of 0.28. There was no evidence of a purple spot upon spraying 
with ninhydrin, thus ail the 3-aminopropyltriethoxysifane starting material 
had been consumed in the reaction. UV light also revealed the excess 

10 SIAB which remained following the reaction. From these results, It was 
determined the reaction Is complete after approximately one minute. In 
all cases, the iodoacatamldo-functionalized wafers were used 
rmmediateh^ to minimize hydrolysis of the iaUle lodoacetamido- 
functionality. Additionally, all further wafer manipulations were 

IS performed in the darit since the iodoacetamido-functionality Is light 
sensitive. 

Disulfide reduction of the modified oligonucleotide was monitored 
by observing a shift in retention lime on reverse-phase FPLC. It was 
determined that after five hours In the presence of DTT (100 mlVI) or 

20 TCEP (10 n\M), the disulfide was fully reduced to a free thiol. If the DTT 
reaction was allowed to proceed for a longer time, an ollgonucleDtide 
dimar formed in which pairs of free thiols had reacted. Such dimerization 
was also observed when the DTT was removed following the completion 
of the cleavage reaction. This dimerlzation was not observed when 

25 TCEP was employed as the cleavage reagent since this reaction is 

performed at pH 4.5, thus tlie free thiols were fully protonated inhibiting 
dimerization. 

Immediately foElowing disulfide cleavage, the modified 
oirgonucleotide was incubated with the iodacetamrdo-functionalized 
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wafers. To ensure complete thiol deprotonation, the coupling reaction 
was performed at pH 8,0. The probe surface den&ity achieved by this 
chemistry of silicon wafers was analysed using radiolabeled probes end a 
phosphorimager. The probe surface density was also monitored as a 
5 function of the TCEP concentration used In the disulfide cleavage 
reaction {Rgure 8), Using 10 mM TCEP to cleave the disuffide and the 
other reaction conditions described above, it was possible to reproducibly 
yield a surface density of 250 fmol per square mm of surface. Identical 
experiments as described above were performed except that the 
10 oligonucleotide probe lacked a thiol modifioatron; surface densities of less 
than S fmol per square mm of surface proved that non-specific binding is 
minima] and that probe coupling most likely occurred as proposed fn 
Figure 7. 

Hybridization 

1 5 After anaching ^^S-labeled probes to the surface of wafers and 

determining conjugation density as described above, hybridization of ^^P- 
labeled oligonucleotides was carried out" hybrUization efficiency and 
density were determined usmg the phosphorimager and copper foil. It 
was determined experimentally that copper foil blocks 98,4% of an 

2D signal, while fully allowing e '^P signai to be detected. The 

complementary sequence reproducibly hybridized to yiaJd 105 fmol per 
square mm of surface; this corresponds to approximately 40% of the 
conjugated probes available for hybridization. Similarly, a non- 
complementary sequence was employed in thfs scheme yielding less than 

25 5 fmol per square mm of surface in non-specific binding. 

It is hypothesized that stearic interference between the tightiy 
packed oligonucleotide on the flat surface inhibits hybridization 
efficiencies higher that 40%. With this in mind, a spacer molecule was 
incorporated between the terminus of the hybridizing region of the 
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oNgonucleotide and the support. The chosen spacers were a series of 
poly dT sequences ranging in Jength from 3 to 25. Upon examination of 
these samples with radiolabels and the phosphorinriager, it was 
detemnined that 40% was still the maximum hybridization that could be 
5 achieved. 

MALDI-TOF MS Analysis 

Wafers were funotioriaHzed with probes, complememary 
sequences were hybridized, end the san^les were aiuiyzed under 
standard MALDJ conditions as described above. Analysis revealed that 

10 only the anneaied strand {MJM6) was observed In the mass spectrum 
with an experimental mass-to-charge ratio of 841 5.4; the theoretical 
mass-to-charge ratio is 64 T 5 {Rgure 9). Since there was no signal at a 
mass-to-charge ratio of 6465, it was determined that ths wafer- 
conjugated strand {TCUC} was not desorbed thus the lodoacetamldo 

15 linkage was stable enough to withstand the laser and remain intact. 
There was an additional signal observed at a mass-to-charge ration of 
6262.0. This signal results from a depurination of guanosines since it Is 
known that DNA is susceptible to the loss of purine bases during the 
MALDI process, Wordoff, E., §£aL M992) Rapid Commun. Mass 

20 gpe<;:fpm. Jg:771 -7761. The sample crystals on the wafer were not 
homogeneously distributed, thus It was necessary to hunt for a good 
spot , Because of this non-homogeneity, the mass resolution varied, but 
it generally ranged from 200-300 for the desorbed oligonucleotide in the 
ntass spectra. In one set of experiments, non-complementary sequences 

25 were hybridized to the wafer; following a wash as previously described, 
analysis by MALDI-TOF MS revealed that minimal non-speaific annealing 
had taken place since no signal was detected. 
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EXAMPLE 2 

Preparaton of DNA array* udng serial and parallel diapenaing tools 

Robot-driven serial and parallel pL-nL dlspenaing tools were used 
to generate 10-10' elament DNA arrays on < 1 " square chips with flat or 
5 geometrfcally altered (e.g. with welJsJ surfaces for matrix assisted laser 
desorptlon Ionization mass spectrometry analysis. In the former, a 
•piezoelectric pipette' (70 urn id capillary) dispenses single or multiple 
- 0,2 nL droplets of matrix, and tfiert analyte, onto the ohip; spectra 
from as low as 0.2 fmol of a 36-mer DNA have been acquired using tills 

to procedure. Despite the fast |< 5 sec) evaporation, micro-crystals of 3- 
iiydroxyplcolinio acid matrix containing the analyte ere routinely produced 
resulting in higher reproduclbiHty than routinely obtained with larger 
volume preparations; all of 100 five fmol spots of a 23-mer in 800 jim 
wells yielded easily interpreted mass spectra, with 99/100 parent ion 

tS signals having signal to noise ratio of >5. in a second approach, probes 
from 384 well microtiter plate are dispensed 16 at a time into chip wells 
or onto flat surfaces using an array of spring loaded pins which transfer 
-20 nL to the chip by surface contact; MS analysis of array elements 
deposited with the parallel method are comparable in terms of sensitivity 

20 and resolurion to those made with the serial method. 

Description of the plewelectric seriaf dispenser 
The experimental system developed from a system purchased from 
Microdrop GmbH, Norderstedt Germany and can include a piezoelectric 
element driver which sends a pulsed signal to a piezoelectric element 

2B bonded to and surrounding a glass capillary which holds the solution to 
be dispensed; a pressure transducer to load (by negative pressure) or 
empty |by positive pressure! the caplUary; a robotic xyz stage and robot 
driver to maneuver tiie capillary for loading, unloading, disp nsing, and 
cleaning, a stroboscope and driver pulsed at the frequency of the pfezo 
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element to enable viewing of 'suspended' droplet charaoteristies; 
separate stages for source and designation plates or sample targets (i.e. 
SI chip): a camera mounted to the robotic arm to view loading to 
designation plate; aid a data station wliJch controls the pressure unit, 
5 xyz robot, and piezoelectric driver. 

Description of the parallel dispenser 

The robotic pintool includes 1 8 probes housed in a probe block 
and mounted on an X Y, Z robotic stage. The robotic stage was a gantry 
system which enables the placement of sample trays below the arms of 

10 the robot. The gantry unit itself Is composed of X and Y arms which 
move 250 and 400 mm, respectively, guided by brushless linear servo 
motors with positional feedback provided by linear opticai encoders. A 
lead screw driven 2 axis (50 mm vertical travel) is mounted to the xy axis 
slide of the gamry unit and Is controlled by an in-line rotary servo motor 

IS with positional feedback by e motor-mounted rotary optical encoder. The 
work area of the system Is equipped with a slide-out tooling plate that 
holds five microtiter plates (most often. 2 plates of wash solution and 3 
plates of sample for a maximum of 1 152 different oligonucleotide 
solutions) and up to ten 20x20 mm wafers. The wafers are placed 

20 precisely in the plate egainst two banking pins and held secure by 

vacuum. The entire system is enclosed in plexl-glass housing for safety 
and mounted onto a steel support frame for thermal and vibratronal 
damping. Motion control is accomplished by employing a commercial 
motion controller which was a 3-axi8 servo cortroller and is integrated to 

25 a computer; programming code for specific applications is written as 
needed. 

Samples were dispensed with the serial system onto several 
surfeces which served as targets in the MALDI TOF analysis including (1) 
A flat stainless steel sample target as supplied for routine use In a 
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Therma Bjoanalysis Vision 2000; (2) the same desrgn stainless steel 
target with micromachined nanopits; (3) flat siiicon {SD wafers; (4) 
polished flat Si wafers; (5) Si wafers with rough (3-6 pLm features) pits; 
(6)(a) 12x12 or ((b) 18x18) mm Si chips with (a) 10x10 tor (b) 18x16) 
5 arrays of chemicafly etched wdls, each 800x800 llm on a side with 
depths rar^ging from 99-400 (ortbl 1 20) micrometer, pitch (a) t .0 (or(b) 
1.125) mm; (7) 15x15 mm SI chips with 28x28 arrays of chemically 
etched weMs, each 450x450 micrometer on a side with depths rar)ging 
from 48-300 micrometer, pitch 0.5 mm; (8)flat polycarbonate or other 

10 plastics; (9) gold and other metals; (10) membranes; (11) plastic surfaces 
sputtered with gold or other conducting materials. The dispensed 
volume is controlled from lO '** to 10* L by adjusting the number of 
droplets dispensed. 

Sample Preparation and Dispensing 

15 1. Serial 

Oligonucleotides (0.1-50 ng/microliter of different sequence or 
ooncentrations were loaded into wells of a 96 well microtiter plate; the 
first well was reserved for matrix solution. A pitted chip (target 6a in 
MALDI targets' section) was placed on the stage end aligned manually. 

20 Into the (Windows-based) robot control software were entered the 
coordinates of the first well, the array si2e (ie number of spots In x and 
y) and spacing between elements, and the number of 0.2 nL drops per 
array element. The capillary was filled with ~ 10 /rf. rinse H^O. 
automaticaliy moved in view of a strobe light-llluntinated camera for 

25 checking tip integrity and cleanliness while in continuous pulse mode, 
and emptied. The oapillary was then filled with matrb< solution, again 
checked at the stroboscope, and then used to spot an array onto flat or 
pitted surfaces. For reproducibilty studies In different MS modes, 
typicaUy a 1 0x10 array of 0.2-20 nL droplets were dispone d. The 
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capilJary was emptied by application of positive pressure, optionally 
rinsed with HzO, and led to the source oligo pJate where SpL of 0.05- 
Z.Q/OA synthetic oligo were drawr>. The cepUlary was then restered ir> 
series over each of the matrix spots with 0.2-20 nL aqueous solution 
5 added to each. 

2. Parallel 

Parallel Programs were wrftten to control array making by offset 
printing? to make en array of 64 elements on 10 wafers, for example, the 
tool was dipped into 16 welfs of a 3 84 weli DNA source plate, moved to 

10 the target (s^ Sf, plastic, metal), and the sample spotted by surface 
oontect. The tool was then dipped into the same 16 walls and spotted 
on the second target; this cycle was repealed on alJ ten wafers. Next 
the tool was dipped in washing solution, then dipped Into 16 different 
wells of the source plate, and spotted onto the target 2.25 mm offset 

15 from the in'rtiaJ set of 16 spots; agatn this was repeated on all 10 wafers; 
the entire cycle was repeated to make a 2x2 array from each pin to 
produce an 8x8 array of spots (2x2 elements/pin X 16 pfns « 64 total 
elements spotted). 

To make arrays for MS analysis, oligonucleotides of different 

20 sequences or concentrations were loaded into the wells of up to three 
different 384-weH microtiter plates, one set of 16 wells was reserved for 
matrix solution. The welJs of two plates were filled \mth washing 
solution. The five microtiter plates were loaded onto the sJide-out tooling 
plate. Ten wafers were placed abutting the banking pins on the tooling 

25 plate, and the vacuum turned on. In cases where matrix and 

oligonucleotide were not pre-mfxed, the pintool was used to spot matrix 
solution first on all desired array elements of the ten wafers. For this 
example, a 16 x 16 array was created, thus the tool must spot each of 
die isn wafers 16 times, with an offset of 1 .125 mm. Next, the 
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oligonucleotide soJution was spotted in the same pattern to re-dissoive 
the inatrix. Similarly, an array could be made by placing the 
Qlrgonucleotide solution on the wafer first fofiowed by the rnatrix 
solution, or by pre-mbdng the matrix and oHgonucleotrde solutions, 
5 Mass spectrcmstry 

Subsequent to either dispensing schemer loaded chfps were hefd 
onto a MALDI-TOF source plate with a set of beveled screw mounted 
polyoarbonated supports. The plate was transferred on the end of a 
probe to be held onto a 1 //m resolution, 1 " travel xy stage (NewportI fn 
10 The source region of a time-of -flight mass spectrometer. The Instrument, 
normaliy operated wJth 18-26 kV extraction, could be operated In linear 
or curved field reflectron mode, and rn continuous or delayed extraction 
mode. 

Serial dispensing with the piezoelectric pipette 

15 While delivery of a saturated 3HPA solution can result in tip 

clogging as die solvent at the oapinary-alr Interface evaporates, pre- 
mixing DNA and matrix sufficiently dilutes the matrb< such that K remains 
in solution whtle stable sprays which could be maintained until the 
capillary was emptied were obtained; with 1:1 diluted (in H2O) matrix 

20 solution, continuous spraying for > > 10 minutes was possible. Turning 
off the piezo element so that the capillary sat Inactive for >5 minutes, 
and reactivating the piezo element also did not result in a cicgged 
capfltary. 

Initial experiments using stainless stee[ sample targets as provided 
25 by Finnlgan Vision 2000 MALDUTOF system run in reflectron mode used 
a pre*mb{ed solution of the matrix and DNA prior to dispensing onto the 
sample target. In a single microtlter well, bOfiL saturated matrix solution, 
25pL of a 51;/L sotutlon of Ao 12-mer (ATCGJ3 {SBQ fD No, 3), and 
25/iL of a Bl/jil solution of the 28-mer {ATCG)7 fSEQ ID No. 4t v;ere 
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mixed. A set of 10x10 arrays of 0.^ drops was dispensed directly 
onto a Rnnlgan Vision 2000 sample target disl^; MALDI-TOF mass 
spectrum was obtained irom a single array demant which contained 750 
Bttomoles of each of tha two oligonucleotides. Interpretable mass spectra 
6 has been obtained for DNA molecules as large aa a 53-mer (350 amof 
loaded, not shown) using this method. 

Mass spectra were also obtahed front DNA molecules 
microdispensed into the wells of a silicon chqi. Figure 10 shows a 
12x12 mm silicon chip with 100 chemically etched welJs; mask 

10 dimensions and etch time were set such that fustum (i.e.. Inverted flat 
top pyramidai) geometry wells witfi SOOxSOQium (top surface) and 100;/m 
depth were obtained. Optionally, the wells can be roughed or pitted. As 
described above, the chip edge was aligned against a raised surface on 
the stage to define the x and y coordinate systems with respect to the 

15 capiflary. (Alternatives Include optical alignment, artificial Intelligence 
pattern recognition routines, and dowel-pin based manual afignment). 
Into each well was dispensed 20 droplets (~5 nL) of 3-HPA matrix 
solution without analyte; for the 50% CH^CN solution employed, 
evaporation times for each droplet were on the order of 5-10 seconds. 

20 Upon solvent evaporation, each microdispensed matrix droplet as viewed 
under a 120X stereomicroscope generally appeared as an amorphous and 
'milky' flat disk; such appearances are consistent with those of droplets 
from which the Figure 3b specirum was obtained. Upon tip emptying, 
rinsing, and refilling with a 1 .4//m aqueous solution of a 23-mer DMA 

25 (M,(calcJ = 6967 Da), the capillary was directed above each of the 1 00 
spots of matrix where 5 nL of the aqueous DNA solution was di$j>ensed 
directly on top of the matrix droplets. Employing visualization via a CCD 
camera, it appeared that the aqueous analyte solution mixed with and re- 
dissolved the matrix (complete evaporation took - 10 sec at ambient 
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temperanire and humidity}. The amorphous matrix surfaces were 
converted to true micro-crystalline surfaces, with crystalline features on 
the order of < 1 pm. 

Consistent with the improved crystalRzation afforded by the matrix 
5 re-dlssolvIng method, mass spectrum acquisition appeared more 

reproducible than with pre-mixed matrix plus analyte solutions; each of 
the 100 five fmol spots of the 23-mer yielded interpreted mass spectra 
(Figure 11}, with 90/100 parent ion signals having signal to noise ratios 
of >5; such reproducibility was also obtained with the flat silicon and 

10 mstayic surfaces tried (not shown). The Figure 1 1 spectra were obtained 
on a linear TOF Instalment operated at 26 kV. Upon intemai calibration 
of the top left spectrum (well 'k1') using the singly and doubly charged 
molecular tons, and application of this calibration file to all other 99 
spsctra as an external calibration (Figure 12), a standard deviation of <9 

15 Da from the average molecular weight was obtained, corresponding to a 
relative standard deviation of -0.1%. 

Parallel dispensing witii the robotio pintooi 
Arrays were made with offset printing as described above. The 
velocity of the X and Y stages are 35 Inches/seo, and the velocitv of the 

20 Z stage is 5.5 inchesysec. It is possible to move the X and Y stages at 
maximum velocity to decrease the cycle times, however the speed of tha 
Z stage Is to be decreased prior to surface contact with the wafer to 
avoid damaging it. At such axes speeds, the approximate cycle time to 
spot 16 elements (one tool impression of the same solutions) on all ten 

25 wafers is 20 seconds, so to make an array of 256 elements would take 
-5.3 minutes. When placing diff^ent oligonucleotide solutions on the 
array, an additional washing step much be incorporated to clean the pin 
tip prior to dipping in another solution, thus the cycle time would 
increase to 25 seconds or 8.7 minutes to make 10 wafers. 
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Sample delivery by thd tool was examined using radio-labeled 
solutions and the phosphorimager as described previously; it was 
detemnlned that each pin delivers approximately T nL of liquid. The spot- 
to-spot reproducibrltty is high. An array of 256 oligonucleotide elements 
5 of var^ng sequence and concentration was made on flat silicon wafers 
using the pintool, and the wafer was analyzed by MALDI-TOF MS, 

EXAMPLE 3 

Use of High Density Nucleic Acid Immobilization to Generate Nucleic Acid 
Arrays 

10 Employing the high density attachment procedure described in 

EXAMPLE 1 , an array of DNA oligomers amenable to MALDI-TOF mass 
spectrometry analysis was created on a silicon wafer having a pJurality of 
Jocflttons, e>q., depressions or patches, on Its surface. To generate the 
array, a free thlol-contalnfng oligonucleotide primer was immobilized only 

15 at the selscted locations of the wafer <e,fl.. ses Figure 13). Each 
location of the array contained one of three different oligomers. To 
demonstrate that the different Immobilized oligomers could be separately 
detected and distinguished, three distinct oligonucleotides of differing 
lengths that are complementary to one of the three oligomers were. 

20 hybridized to the array on the wafer and analyzed by MALDI-TOF mass 
spectrometry. 

Oiigodeoxynucteotides 

Three sets of complementary oligodeoxynucieotide pairs were 
synthesized in which one member of the complementary oligonucleotide 
25 pair contains a 3'- or 5'-drsulflde linkage (purchased from Operon 
Technologies or Ollgos, Etc.}, For example, Oligomer 1 
(d<CTQATQCQTCGGATCATCTTTTTT-SS); SEQ ID NO: 5) contains a 3'- 
disuffide linkage whereas Oligomer 2 {dISS* 

CCTCTTGGGAACTGTQTAGTATT): a B'-dlsulffde derlVHtfve of SEQ JD 
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NO; 6) and Oligomer 3 (d(SS-GAATTCGAGCTCGGTACCCQQ); a 5 - 
disulfide derivative of SEQ ID NO: 1) each contain d 6'-disulfide linkage. 

The oligonucleotides complementery to OJigomers 1 -3 were 
designed to be of different lengths that are easily resolvable from one 
5 another during MALDI-TOF MS analysis. For example, a 23-mer 
oligonucleotide (SEQ ID NO: 7) vt^as synthesized complementary to a 
portion of Oligomer 1, a 12-mer oligonucleotide (SEQ ID NO: 8) waa 
synthesized complementary to a portion of Oligomer 2 and a 21-mer 
(SEQ ID NO: 2; sequence denoted '^MJMe" in EXAMPLE 1) was 

10 synthesized complementary to a portion of Olrgomer 3. In addition ^ a 
fourth 2g^mer oligonucleotide (SEQ ID NO: 9) was synthesized that lacks 
complementarttv to any of the three oligomers. This fourth 
oligonucleotide was used as a negative control. 

SHicon surface chemistry and DNA immobilizatfan 

15 (a) 4x4 (16-location) array 

A 2 X 2 cm^ silicon wafer having 266 indhfidual depressions or 
wells in the form of a 1 6 X 16 well array was purchased from a 
commercial supplier ^Accelerator Technology Corp., College Station, 
Texas L The wells were 800 X 800>cm^^ 120^m deep, on a 1.125 

20 pitch. TTie silicon wafer was reacted with 3-amlnopropyltrlethoxyailane 
to produce a uniform layer of primary amines on the surface and then 
exposed to the heterobifunctfonal crosslinker SIAB resulting In 
iodoacetamido functionalities on the surface (e_>q, . see Figure 7). 

To prepare the oligomers for coupling to the various locations of 

25 the ^licon array, the disulfide bond of each oligomer was fuJIy reduced 
using 10 mM TCEP as depicted in EXAMPLE 1, and the DNA 
resuspended at a final concentration of 10 in a solution of 100 mM 
phosphate buffer, pH 8.0, immediately foHowrng disulfide bond 
reduction, the f ree-thiol group of the oligomer was coupled to the 
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iodoacetamldo functionality at 16 locations on the wafer using the probe 
couplir>g conditions essentially ss described in Figure 7. To accomplish 
the separate coupling at 1 6 distinct locations of the wafer^ the entire 
surface of tiie wafer was not flushed with an oligonucleotide solution 
6 but, instead, an -30-ni aliquot of a predetermined modifled oligomer 
was added in paraJiei to each of 1 6 Jocetions (i.e.. depressfons) of the 
256 wells on the wefer to create a 4 x 4 array of immobilized ONA using 
e pin tooJ as described herein (see fi^, the Detailed Description and 
Example 4 provided hereml. 

10 Thus, as shown in Figure 13, one of modified Oligomers 1-3 was 

covalently immobilized to each of 16 separate woJte of the 258 weils on 
the silicon wafer thereby creating a 4 x 4 array of immobilized DNA. For 
example. Oligomer 1 was conjugated at a well positron in the upper left 
hand comer of the 4 x 4 err ey and Oligomer 2 was conjugated to the 

1 5 adjacent location, and so forth. An illustration of the compfetad array is 
shown in Figure 13. 

in carrying out the hybridization reaction, the three complementary 
oligonucleotides and the negative control oiigonucleotide were mixed at a 
final concentration of lO^IVl for each ongonucleotide in 1 mJ of TE buffer 

20 (10 mM Tris-HG, pH 8.0, 1 mM EDTA) supplemented with 1 M NaCI, 
and the solution was heated at 8S°C for 10 min. Immediately thereafter, 
the entire surface of the silicon wafer was flushed with 800 fj\ of the 
heated oligonucleotide solution. The compiemeMary ollgonudeotides 
were annealed to the immobilized oligomers by incubating the silicon 

25 array at ambient temperature for 1 hr, followed by incubation at 4"C for 
at least 10 min. Alternath/ely, the oligonucleotide solution can be added 
to the wafer which is then heated and allowed to cool for hybridization. 
An IHustfation of the complem ntary oligonucleotides annealed to the 
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specif ic oligomers covatdntly immobiCzed at each location is shown in 
Figure 14« 

The hybridized array was than washed with a solution of 50 mM 
ammonium citrate buffer for cation exchanoe to remove sodium arKi 
5 potassium ions on the DNA backbone iPieleSe U. et al., (1993) NucL 
Acids Res,21;3191>3196). A 6-nl aliquot of a matrix solution of 3- 
hydroxypicoHnic acid (0.7 M S-hydroxypioolinic acid-10 % ammonium 
citrate in 50 % acetonitrite; see Wu ej aL Rapid Commun. Mass 
SOfiStrorru 7:142-146 {T9S3}} was added to each location of the array 
to using a piezoelectric pipette as described herein. 

The solution was allowed to dry at ambient temperature and 
thereafter a 6-nl alrquot of water was added to each location using a 
piezoelectric pipette to resuspend the dried matrix-ONA complex, such 
that upon drying at ambient temperature the matrix-DNA complex fomis 
15 a uniform crystalline surface on the bottom surface of each location » 
MALDI-TOF MS analysis 

The IWALDI-TOF MS analysis was performed in series on each of 
the 16 locations of The hybridization array Hlustrated in Figure 14 
essentially as described in EXAMPLE 1. The resulting mass spectrum of 

20 oligonucleotides that specificatiy hybridized to each of the 16 locations of 
the DNA hybridization array is shown in Figure IS. The mass spectrum 
revealed a specific signal at each location representative of observed 
experimental mass-to-charge ratio corresponding to the specific 
complementary nucleotide sequence. 

25 For exampler in the locations ihst have only Oligomer 1 conjugated 

thereto, the mass spectrum revealed a predominate signal with an 
observed experimental mass-to-charge ratio of 7072.4 approximately 
equal to that of the 23-mer; the th oretical mass-to-charge ratio of th 
23-mer is 7072.6 Da. Similarly, specific hybridization of the 1 2-mer 

SUBSimiTE SHEET (RULE 2E) 



wo 99/312711 



FCTAIS9&'267ir 



-77- 

□IFgonudeotide to the array, obsarved exparlmental mass-to-charge ratio 
of 3618<33 Da (thdoretical 3622.4 DaK was detected only at those 
locations conjugated with OKgomer 2 whereas specific hybridization of 
MJM6 (obsarved experimental mafis-to-charge ratio of 6415.4} was 
5 detected only at those locations of the array conjugated with Oligomer 3 
(theoretical 6407.2 Oal. 

None of the locations of the array revealed a signal that 
corresponds to the negative controi 29-mer oligonucleotide (theoretical 
maas-to-charge ratio of 8974.8) indicating that specific target DNA 
10 molecules can be hybrrdized to oligomers covalentfy immobilized to 
specific locations on the surface of the ailicon array and a plurality of 
hybridization assays may be individually monitored using MALDi-TOF MS 
analysis. 

(b) 8x6 (64-locaifon) array 

A 2 X 2 cm^ silicon water having 256 mdividual depressions 
or wells that form a 16 X 1 6 an^ay of welis was purchased from a 
commercial supplier (Accelerator Technology Corp., CoDege Station, 
Texas ). The wells were 800 X 800 120 //m deep, on a 1.1 25 
pitch. The sjlicon water was reacted with 3-aminopropyltrlethoxysilar^e 

20 TO produce a uniform layer of primary amines on the surface and then 
exposed to the heterobifunctional crosslinker SIAB resulting In 
lodoacetamido functionalities on the surface < _6.q> , see Figure 7). 

Following the procedures described above for the preparation of 
the le-location DNA array, Oligomers 1-3 were immobilfzed to 64 

2B locations forming an 8 X 8 array on the 256 welt silicon wafer, 

hybridized to complementary oligonucleotides and analyzed by MALDI- 
TOF MS analysis. Figure 16 shows the mass spectrum of the 64- 
locatron DNA array analyzed in series by MAIXI-TOF analysis. As shown 
for the 16-iocation array, specific hybridization of the compi mentary 
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SP6 RNA Polymerase Transcriptian 

in vj^r^ transcription of the nicked DNA templates was performed 
in 20jul reactions of 40 mM Tris-HCI (pH 7,01, 6 mM MgCJj. 2 mM 
spermidine, lOmlVI NaCI, 10ml\i dithiothreitof, 1 unrt//ul RNasin 
5 <PromegB>, 5 mM rNTP, 5;jCi lff-32PlrCTP, 1 uniU/A SPQ RNA polymerase 
(Amersham, Arlington Heights, IL) at 37''C for 30 minutes. Abortive end 
full length RNA transcripts ware separated by gef electrophoresis and 
quantified by measuring the radioactivity of individual RNA fragments by 
drying the poiyacrylamrde gel and measuring the radioactivity as 

10 compared to a known standard using a Phospholmager (Motecular 
Dynamics, Inc.). The effioiency of fulHength RNA transcription of a 
nicked DNA template was calculated as a percentage the moles of full 
length RNA transcribed from a DNA template containing no nicks. The 
nick by pass efficiency of a nicked DNA template was calculated as a 

15 percentage of the moles fuU-length RNA transcript and the moles of RNA 
transcript stalled at the niok. 

As illustrated in Table 1 , the transcription of a full-length RNA 
proceeds with 88-94% efficiency when a nrck is introduced into the 
coding strand after nucleotides + 7, +8, + 9 or 4 19 reJative to the start 

20 of transcription. 



TABLE 1 





RNA (%» 


isiicfc-sUlied 


Full-Lcngth 
RNA (%) 


Niclc-Bypass 

E-fflciency 

(%> 


Ref. 


320±93 




(100) 


100±0 


1N + 7U 




12.45.4 




a8.5±5.1 


1NH8U 


187.4:1:38.9 


5.3-4.2 


87.1+4.3 


94.3±0.5 


1 1N^9U 


232.7±45.6 


10,1 ±1. 7 


65.7 ±4.4 


87.6±1.8 


1 1N+19U 


27a.e±33.8 


6.6 las 


64.8 ±\5 


90.9 40.^ 
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EXAMPLE 5 

DMA Sequencing using T7 RNA Polymerase 

5 1 . Design of Template and Primer Sequences 

All primers were synthesized on a commercially available DNA 
Synthesizer using conventional phosphoroamidite chemistry (Sinhe gt ai 
(1 984} Nucleic Acid Bgs. 12:4539) . jn Vitro RNA transcription was 
performed on a synthetic 276 nudeotide doubla-etranded DNA template 
10 (SEQ ID No. 13, which includes a T7 promoter). 

2. ONA Sequencing 

DNA sequencing of a target DNA template was carried out in 10 //I 
reactions of 40 mM Tris-Ha (pH 7.9), 6 mM MgClj, 2 mM neutralized 
spermidine. 5 mM dithiothreitol, 300 nM rCTP, 300 nM rATP, 300 nM 

15 rUTP, 300 nM rITP, BOO nlW rGMP. 10 f/C\ (a-'»P) rCTP or UTP. 10-300, 
generaUy TO-50 jM 3'-deoxynucteotide, 0,5-1 .0 pmole linearized or 
supercoiled DNA template. 4 units RNaain (Promega), 10 unit///l T7 RNA 
polymerase (USB) at 37"C for 30 minutes. Chain termination fragments 
RNA transcripts were separated by gel electrophoresis using 8% or 13% 

20 polyaorylamide gel. 

DNA sequencing ladders of RNA temiinated fragment were 
generated up to 180-200 bases. This example shows that T7 RNA 
polymerase can specif iealiy incorporate modified S'-deoxyribonuclecsides 
triphosphates as a base-specific chain termhator to generate nested HNA 

25 transcripts for sequencing nucleic acids, 

Sfrice nwdifications will be apparent to those of skill in this art, it 
is intended that this Invention be limited only by the scope of the 
appended claims. 
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WHAT IS CLAIMED: 

1 . A method for dBtermrning the sequence of a target nucleic 
acid molecule, comprising: 

a) providing a nucleic acid molecule comprising a promoter and 

5 target nucleic acid sequence operatively linked to the promoter, wherein 
Ihe nucleic acid molecuJe is Immobilized on a solid support; 

b) transcribing the promoter-containing nucleic acid molecule with 
an RISIA polymerase that recognizes the promoter under conditions 
whereby a nested set of RNA transcripts from the target is produced; 

10 and 

c) determining the molecular weight values of the transcripts by 
mass spectrometry and thereby determining the nucleic acid sequence of 
the target molecule, 

2. The method of claim 1, wherein the promoter-containing 
15 nucleic acid moiecuJe is {mmobiii2ed via the 5' end of the coding strand 

or the 3' end of the non-coding strand. 

3. The method of claim 1^ wherein the nucleic acid is DNA or PIMA. 

4. The method of any of claims 1-3, wherein the nucleic acid 
molecuJe comprising a promoter and target Is prepared by: 

20 hybridizing a nucleic acid promoter-containing probe having a 

coding and noncoding strand to a nucleic acid molecule comprising a 
target nucleic acid sequence, wherein hybridization occurs between e 
single stranded region of the promoter-containing probe and a 
complementary e'rngle stranded region at one end of the molecule 

25 comprising the target. 

5. The method of claim 4, wherein the molecule comprising the 
target Is single-stranded and the portion comprising the complementary 
singte-stranded region is at the 3' end of the target-containing molecule; 



SUBSTITUTE SHEET (RULE 28) 



-82- 

and the promoter containing probe molscuFe is double-stranded 
with a single-stranded region of at least 5 nticJeotides at the 3' end of 
the coding strand, wherein at least 5 contiguous nucleotides are 
complementary to the single-stranded portion of the target-containing 
5 molecuie. 

6. The method of cfaim 4, wherein: 

the molecule comprising the target is double-stranded except for a 
single-stranded region et one end that comprises at least 5 contiguous 
nucleotides compiemantarv to the single-etranded portion of the 
10 promoter-oontelning molecule; and 

the singfe-stranded portion of the promoter-containing molecule is 
at seme end, relative to the promoter, that the single-stranded region is 
on the target-containing molecule. 

7. The method of any of claims 4-6, wherein the promoter- 
1 5 containing molecule is immobUized on the solid support prior to 

hybridization of the target-containing molecule. 

8. The method of any of claims 4-6, wherein hybridization is 
performed prior to Immobilization of the promoter-containing molecule on 
the solid support. 

^ ^- method of any of claims 4-8, wherein nicks in the 

resulting molecules are ilgated prior to transcription by the RNA 
polymerase. 

10. The method of any of claims 1-9, wherein the nucleic acid 
molecule oompriaing a promoter and target nucleic acid is transcribed 

25 with an RNA polymerase to produce a plurality of base-specif icaily 
terminated RNA transcripts. 

11, The method of any of claims 1-9, wherein the nucleic acid 
molecule comprising a promoter and target nucleic acid Is transcribed 
with an RNA polymerase to produce a set of RNA transcripts that are 
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then speciflcallv cleaved wtth baSB-speclflc ribanucleases to produce a 
set of cleaved fragments whose molecule weight values are assessed by 
mass spectrometry. 

12. The method of any of claims 1-11 wherein the sequence of 
5 all or a portion of the target nucleic acid is determined by the RNA 

transcripts according to molecular wdght. 

13. The method of any of claims 1-12, wherein the nucfeic acid 
molecule comprising the promoter is produced by Immobilizing a ^ngle- 
siranded molecule that comprises a promoter or the complement of a 

TO promoter and hybridizing a fragment that comprises the complement 
thereof, whereby the resulting molecule comprises a double-stranded 
region contain'mg a promoter and a singie-stranded portion of at least 5 
nucleotides. 

14. The method of any of claims 1-13. wherefn the promoter is 
15 selected from the group consisting of phage, eukaryotic and prokaryotic 

promoters. 

IS- The method of claim T4, wherein the promoter is selected 
from erchaebacteria, eubacterte, bacteriophages, DNA viruses, RNA 
viruses, plants, plant viruses and animal promoters » 
2** 1 6- The method of any of claims 1 -1 5, wherein the RNA 

polymerase is a DNA-dependent RNA polymerase or an RNA-dependent 
RNA polymerase. 

17. The method of any of claims 1-16, wherein the RNA 
pofymerase is selected from the group consisting of phage, eukaryotic 

25 and prokaryotic polymerases. 

18. The method of claim 17, wherein the polymerases is 
selected from archaebacteria, eubacteria, bacteriophages, DNA viruses, 
RNA viruses, plants, plant viruses and animal polymerases. 
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1 9. The method of any of claims 1-18, wherein the BNA 
polymerase is selected from the group consisting of Escherichia coll RNA 
polymerase, T7 RNA polymerase, SP6 RNA polymerase, T3 RNA 
polymerase and replicaae. 
5 20. The method of any of claims 1-T9, wherein prior to 

immobilization of the nucleic acidj the surface of the support rs 
derivatized by reacting the surface with an aminosilane to produce 
primary amines on the surface of the support. 

21 . The method of claim 20, wherein the aminosilane is 3- 
10 aminopropyltriethQxysilane. 

22. The method of claim 21, further comprising reacting the 
primary amines on the surface of the support with a thio^reactive cross- 
Bnklng reagent to form a thiol-reactlve solid support. 

23. The method of claim 22, wherein the thiol-reactive cross- 
15 linking reagent is N-succinimldyl (Anodoacetyl) aminobenzoate (SIAB). 

24. The method of claim 22, wherein the immobilization of the 
nvcJeic aoid probe to a solid support cs effected by reacting the thioi- 
reactive solid support with a nudek: acid probe having a free 5-' or 3'- 
thlol group, whereby a covalem bond between the thiol group and the 

20 thtol-reactive solid support Is formed. 

25. The method of any of claims 1-21 < wherein the nucleic acid 
molecule is immoblBzed to the surface the solid support at a density of at 
least 20 fmol/mm^ by a covaient linkage. 

26» The method of any of claims 1-25, wherein the nucleic acids 
25 are immobiElzed on the surface of the solid support in the form of an 
array. 

27. The method of any of claims 1-26, wherein the solid 
support is silicon or ariicon coated. 
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28. The method of any of clsims 1 -27, wherein the surface 
comprUes a plurality of wells contprlsing the immobilized nucleic acid 
molecule* 

29. The method of cfaim 28, wherein the wells have a rough 
5 inierior surface. 

30. The method of claim 28, wherein the solid support has a 
rough surface. 

31. The method of claim 28, wherein the surface of the welfs rs 

pitted. 

10 32. The method of any of claims 1-31 

wherein the transcription reactions are performed in the presence of a 
rifaonucieoside triphosphate analog, whereby the resulting RNA 
transcripts have reduced secondary structure or the fidelity of temiination 
and turnover of the RNA polymerase enzyme is increased compared to 

15 RTsfA transcripts formed from ribonucleoslde triphosphatase. 

33. The method of claim 32, wherein the transcription reactions 
include one or more analogs selected from among Inosine 5 '-triphosphate 
(rTP), 4-thio uridine 5'triphosphate {UTP). S-bromo UTP and 5'-iodo CTP. 

34. The method of any of claims 1-33, wherein the nucleic acid 
20 is Immobilized to the solid support via a linker. 

35. The method of claim 34, wherein the linkage formed by the 
linker is reversible. 

36. The method of clarm 34, wherein the linkage formed by the 
linker is irreversible. 

25 37. The method of claim 34, wherein the linkage formed by the 

linker is photolabile, acid labile or chemically cleavable- 

38. The method of any of claims 1-37, wherein transcription is 
performed in the presence of one or more 3'-deoxyribonucleotides. 
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39. The method of any of claims 1-38. furth^- compriaing: 
adding a matrix material for mass spectrometry to the surface of 

the support before or after immobilization of the nucleic acid. 

40. The method of any of claims 1-39, wherein the RNA 
5 transcripts are conditioned prior to performing maas spectrometry. 

41 . The method of any of claims 1-4, wherein the hybridization 
of the nucleic aoid containing the target sequence to the promoter- 
containing molecule results in the formation of a nick in the coding strand 
corresponding lo positions beyond + 8 relative to the start of 

10 transcription from tfie promoter. 

42. The nwthod of daim 24, wherein the nick is at position 
+7, +8, -h9or +19. 

43. The method of claim 1 , comprising: 

a) immobilizing a nudeic acid promoter-containing probe on e 
16 solid auppon, wherein the nucleic acid promoter-oontaining probe 

comprises at least S nucleotides at the 3'-end of the coding strand that is 
compJententary to a single-stranded region at the 3'-end of the target 
nucleic acid; 

h) hybridizing the nucleic acid to be sequenced to the 
20 immobilized nucleic acid probe; 

c} transcribing the target nucleic aoid with an RNA polymerase 
to produce a plurality of base-specifically terminated RNA transcripts, 
wherein the RNA pdymerase recognizes the promoter; and 
d) determining the molecular weiglit value of each 
25 base-speciflcatly terminated RNA transcript by mass spectrometry. 

44. A method of Identifying transcriptional terminator sequences 
or attenuator sequences, oomprising: 

a) immoblDzing a nucleic acid promoter-containing probe on a 
solid support, wherein the nuci ic acid promoter-containing probe 
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comprises at least 5 nudeotides dt the 3' -end of the coding strand that is 
complennentary to a single stranded region at the 3'-end of the target 
nucleic acid; 

b) hybridizing the nucleic acid to be sequenced to the 
5 immobilized nudeic add probe; 

c) transcribing the target nucleic acid with an RNA polymerase 
to produce a sequence-terminated RNA transcript, wherein the RNA 
polymerase recognizes the promoter; and 

dj determining the molecular weight value of the RNA 
10 transcript by mass spectrometry, whereby the tenninator sequence or 
attenuator is identified. 
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k1 

696B Da 
170 RP 


k2 
6966 Dc 
100 RP 


k3 
6988 Do 
90 RP 


k4 
6977 Oo 
100 RP 


k5 

6971 Oo 
170 RP 


k6 
6969 Do 
no RP 


1<7 
6972 Do 
160 RP 


k6 
6978 Oo 
110 RP 


k9 
6952 Do 
250 RP 


klO 
6965 Do 
300 RP 


11 

6965 Dc 
130 RP 


(2 

6m Do 
140 RP 


13 

6962 Oo 
210 RP 


14 

6996 Da 
50 RP 


15 

6982 Oo 
160 RP 


16 

6968 Do 
1B0 RP 


17 

6904 Do 
130 RP 


16 

6968 Do 
200 RP 


19 

6996 Do 
dO RP 


no 

6968 Do 
100 RP 


mt 
6966 Oo 
190 


in2 
6979 Do 
120 RP 


m3 
6975 Oo 
120 RP 


m4 

696B Do 
190 RP 


mS 
6976 Do 
110 RP 


mS 
6986 Do 
120 RP 


m7 
6973 Oa 
160 RP 


mS 
6978 Do 
160 RP 


m9 
6975 Do 
230 RP 
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nl 

6961 Do 
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n2 
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1BD RP 


n3 
6970 Do 
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120 RP 


nS 
6953 Do 
210 RP 
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140 R? 


nB 
6962 Do 
160 RP 


n9 
6957 Do 
150 RP 


nlO 
6960 Oo 
160 RP 


01 

6965 Oo 

140 RP 


o2 
6960 Do 
230 RP 


o3 
6976 Do 
200 RP 


o4 
6953 Do 
250 RP 


oS 
6983 Do 
110 RP 


o6 
6967 Oo 
250 RP 


o7 
6970 Do 
150 RP 


o6 
6973 Do 
70 RP 


09 
6953 Do 
140 RP 


olO 
6952 Oo 
140 RP 


P' 

6976 Oo 
KO RP 


p2 
6961 Do 

90 RP 


P3 
6972 Do 

loLl Rr 


P4 
6S69 Oo 

on DO 
SO Rr 


p5 
6904 Oa 
1 in DD 


p6 
6968 Oo 

IDA DO 


p7 
6958 Do 
Kr 


P8 
69B1 Do 

iUv l\r 


P9 
6976 Do 


plO 
6965 Do 


6676 Do 
170 RP 


<J2 
69&5 Oo 
too RP 


q3 
6990 Do 
120 RP 


q4 
69B9 Oo 
90 RP 


qS 
69B4 Oo 
90 RP 


q6 
6969 Da 
170 RP 


6979 Oo 
70 RP 


6968 Do 
140 RP 


q9 
6973 Oo 
120 RP 


qlO 
6950 Do 
120 RP 


rl 

G966 Do 
130 RP 


r2 

6960 Oo 
1&D RP 


r3 

6969 Do 
100 RP 


r4 

6964 Do 
160 RP 


r5 

6966 Do 
130 RP 


r6 

6970 Oo 
110 RP 


r7 

6972 Do 
90 RP 


i8 

6939 Do 
130 RP 


r9 

6951 Da 
230 RP 


flO 

6965 Do 
200 RP 


si 

G963 Da 
130 RP 


52 

6953 Do 
210 RP 


s3 
6970 Do 
120 RP 


s4 
6971 Oo 
170 RP 


s5 
6957 Do 
130 RP 


s6 
6956 Oo 
160 RP 


s7 
6966 Do 
140 RP 


s8 
6975 Do 
120 RP 


s9 
6951 Do 
230 RP 


filO 
6969 Oo 
120 RP 


tl 

6974 Oo 
90 RP 


12 

6958 Da 
160 RP 


13 

6959 Do 
120 RP 


t4 

6952 Oo 
too RP 


t5 

6959 Do 
110 RP 


16 

6954 Do 
100 RP 


t7 

6950 Oo 
160 RP 


to 

6974 Do 
140 RP 


(9 

6967 Oo 
150 RP 


no 

6950 Do 
230 W 



LASER POWER « 41000 rOR ALL SPECTRA. 

EACH SPCCTRIAI THE SUU OP 10-30 SINGLE SHOTS. 



FIG. 12 



SUaSlTTUTE 8HST (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



\¥099/3l2?a 



FCTyiT$9!Lr267l8 



U/17 



\ 

<> 

< 


X 


\ 


\ 

x 




\ 

w 






'\ 




\ 

\%>, 


•\ 




'\ 




\ 





SUBSTmiTB SHEET (RULE 26) 



PCT/US91t,'2«7l8 



i5/l7 




SUBSTITUTE SffiET (RULE 28} 



PCT/US98/15719 




GUBSTmrTE 8HEET (RUL£ 26) 



\V099/3]27ft 



1 



PCT/USM8/267W 



LISTING 

GENBJiAT. raFORMATIOW 

(i) APPLICANT; 

(A) NAM?:: 5BC?UENGT^t, INC. 

(B) STRSET; 11555 Sorrento Valley Road 
iC) QZTii San Diftgo 

(i:) SAATE: C'al-ifornia 

{£) COUMTRY; USA 

(F) ?OSThh CODE (ZIP'. : 92121 

ii) APPT.TCAtrr: 

(A) NAXE: XCREA AD\'AKCSE) INSTITUTE 07 SCIslHCE /iND TECHTCOLOGV 

(B) S'lKliJl^r ; ^^73-- Kuaong-doag 

(C) r.JTY: aac-jcn 305 701 

(D) STATE; Yusong-gu 

(E) COUMTKV; Republic of xorea 
(P) PORTAL CODE (zip: i 32121 

IHVBKTOP. 

(A) KTANB: Chaiigvon Kan^ 

{B) STRB^TT: Expo Apt. ^OS-'lOl, IM-l Jeoimin-dong 

(C] ciTYi Taejon Sou -3 30 

(D) GTATi?! Yuaonfl <|ij 

<D; COUNTRY: SBPUELIC OP KOREA 
<E^ ?OS*TA^ COJB (Ziy* i 

(i) ISVENTOR 

(a:« HAMb;: Voung-Soo Kwon 

(R; .STREET: 25-2 ?^angwol-dong 

IC:- CITY: Kvangju 500-500 

fD) STATE I ^uk-gu 

{T» COllKTRYt Republic of Kor£d 

IB) POSTAL (ZIP) ; 



(1) lir/ENTOR; 

(A) NAME: Young Tae Kim 

(D) STRBF": fliirtgwon Apt. 2 204, 162 fJongpa - ?- - cicng 

(Cj CITY! Seoul 13a -172 

(D'l STATE: Sojigpa-ga 

(U) COUNTRY: REPURbTC 

(?:) P05;TAL code i2XF» : 



<i> INVENTOR: 

•:a) NAME: HuberL Kostex 

•IB) STP^ET: 93 S Via Malioxca Orive 

;c) CITY: La Jn>:ia 

S^ATR: Calif uoiia 
(D) COUNTRY: USA 
m POSTAL CODE (ZIP) : 920^1 

(i) TWVRWTOR: 

CA) NA»Ei Daniel P. Little 

(B) 8TRE67: 3^3 Olendale L^e Rd. 

(C) CITY? Patton 

(D) STATZi Pennsylvania 
(u> COUNTRY: USA 

(E) 2QSTA1 (ZIP) i 16668 
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(ij INVENTO:i: 

(Ai HAJ^E; Maryanne O'Donnell 

\J\) STREET : 

iC) CITY; 

(D) STATE; 
IP) nOIlNTRY; 

(E) POSTAL CODE (ZIP) : 

(A) CIAI'IB : Goui3lng Xiang 

rB) STREET: 113B1 Zapate Ave. 

(L'i City; Gar: ::iiegD' 

(Dj STATE ! California 

(D.I COIJNTRY: USA 

(i:) POSTAL CODG (ZiP) ; 92120 



<i) lir^^BNTORs 

•A I NAME: charlcfl cantftr 
{Bi STREET: 

ic\ city:; 

(B> ESTATE: California 
iD> COTJNTRY; USA 

>:e} posTAii ayoii :zip} : 

(i) INV5:>IT0R: 

{A) lUM-!;: David ijough 

IC> C:TV: 

:d) a'rAXli; 

[D) CCXraTRY! USA 

;e) postal code (ZT?) : 



(ill CITLE OF IKVEUTiCN: [4AS3 S:f J^CU'KOWrRIC MISTHODS i*X?K SliOUBNCINn 
NUCr.FTC ACIDS 

(iii) liTJMBES OF SE*OlJJdllCU!» : X'i 

(iv> COPsRESPONDENCE ADDR3SS: 

(A) ADDRESSEE: Heller 2hm^ Whice £i M<*Auli££e 
W ST'RBET: 4250 Tlxecutive Square, 7t'i Floor 
iC) CITY: Let J'S-la 
•D) STATS; CA 
(2) COUNTRY: U£A 
jP) ZIP: 92037 

<v> COXP-JTHR R3ADABLE FORM: 
♦:a> XBDIUM TYPE: Diskistre 
JR) CrOMP^ITBRj IBM Cuinpa-iblcs 
:C> OPERATING SYSTSMj DOS 
p) SOFTWARE £ ASCII 



(v>> CORRENT APPU-CAl'lON UATAt 
(A) APPLICATION SJUMBER: 
{B) FILING DATB: 
(::) CLASSIFIZATXOitJi 



ivil> PRIOR APPT.TCJVTION OATAr 
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(A) APPLICATION m)I4BER; 08/990.651 
(B? PILTN3 DATE: lS-DEC-19i?7 

(viil) ATTCR^Z/AGBWr IKFORMAIIOM: 
(ZS.i NAME; 3ei<jTaark, Sttphanie L 
{31 REQISrHATION NOI-IBEK ; 33,77f> 
(Cj REPEPKNCB/DOCKBT KUMBSRi 24735-20C7 



{ix) TBLECOMYITNICATIOK INFORMATION: 
{Ai TELEPE0MB: 619-450-d40G 
;b:- TELEFAX: €1S- 430-8439 

(2) THFCRl^ATION FOR SEO ID N0:1: 

(i) SBOUENCH GlIARACTBRISTICS; 
(A) LENGTH: 21 base pairs 
IB) TYPE: ruclG-c Acid 
(C } ^7RAMD2:;iJNBSr:> : B ingle 

Hi) mUSCULQ TYPE: cDNA 
< \ 1 i ) HTPnTHETlCALt KO 
<lv) AUTISEETSE; NO 
<v: FHAGHKKr TYPE: 
M) ORIGINAL SOURC£: 



(xi) SEQDENCB DESCRIPTION; 8EQ ID NO;:.: 
3AATTCGAOL' TOXiTACCCG Q 21 
(2) INFOR74ATIOH F03t SBO ZD irO:2: 

ii) flRQTTRKCE CHAPACrERlfiTlCS : 

(A) LENGTH: 21 ba&^ pairs 

[B) TYPE: nucleic acid 
(D STRANDilDKBSS t Single 
(D) TOPOLOGY: unknovin 

(Hi yiOT.TCUT.P. TYPE: CDBIA 
[iii) HYPOTHETICAL: NO 
J-v) A^ITISWSE: HO 
Iv) FRAGI'IBt'7 TYPE: 
ivi) ORIGIKAL SO'JRCBi 

(xi'l SEQDBSCS DESCRIPTXQK: SEQ ID NO: 2: 

C!CG6GTACt<3 AStTTCCSAATT C 21 

(2) INFORMATION FOR SZQ ID N0r3: 

(1} S^gJENC^: CHARACTERISTICS; 
(A) LENGTH: X2 baec pairs 
(3) TYPE: nucleic acid 
(Ci STRANDEDllBSS : Wikngw:i 
(J( TOPOLOQifi unknown 

(ii) MOLSOnE TYPE: cDHA 
(iix) HYPOTHETICAL: ITU 
iiv] ANTISENSE: NO 
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(v) FRAGUBKT TYPE: 

(vi) ORIGINAL SOURCRt 

(xi) SUQ'JEt^CE DESCRIPTION: SEC ID NO; 3; 

ATCGATCGAT CG 12 

(2) IN?ORMAT10H FOR SBQ ID NO:4: 

(-) SEOCBNCB OHAkACTBklS'i'XOSi;; 
i/s) LENGTH: 2E basc pdira 

•3J TY?53 nucleic acid 
{C} SrRANDEUNbSS^ : Single 
\ll> uclknown 

Ui: MOIiHCULE TYPSj cDNA 
<iii) HYPOTHETICAL: KD 
{iv) ANTlSBNeE: NO 
{v) FJ^ACJXENT TifPE: 
<vi) OKiOiNAL S0URC3: 

(xi) SEQUENCE 3BSCRIFTICN: SEQ ID N0:4: 
ATTOATflRAr CGATCCATCC ATC0ATC3 28 
•;2> IKFORPtATION PGR SEQ ID WO: 5: 

(1) SEQUENCE CHARATTBRZSTICS : 

(A) ItBGTGTHi 24 hSL&C: paixs 

(B) T1'F£: nucleic &cld 
iO) 5TRANDEDN3SS : Singles 
(U) lOPOLOOY: wkncwn 

lii) NOLECUIiS TYPE: CDNA 

(iii) ir/POTlIElICAli: NO 

(iv) ANTISENSEi NO 
(v; FRAGMENT TYPE i 
<vi) OUlGItlAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO; 5? 

CTGA7GCGTC GC;ATrATt:TT TTTT 74 

(2) INFORMATION FOR SEQ ID i:0s6; 

{i) EEQUENCS CHARACTERISTICS: 
iA) LBITOTH; 23 base pairs 
IB) TYPE I nucleic acid 
iC) oTRAKDKDNESS I unknown 
(D) TOPOLOGY: unkr.O'jm 

iii) MOhklCVLJ^ T^PB; cDNA 
(ill.) HY70TH3TICAL: NO 
iiyf) AI^IT ISEN SE s NO 

iv) FRAGMBCrr TYPE; 
(vi) ORrOlNAL SOURCE: 

•ixil S£QUEi;rCE DESCRIPTION; S^Q ID irOt6; 
C«CTTGGGA ACTGTGTAGT ATT 23 
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(2) INPORMATICN FOR SBQ 13 ICO:7: 

SEQUENCE CHARACTERISTICS: 

(A) LifiNGJH: 25 ba&e pairs 

(B) TYPE: nucleic acid 
<C) ST1^AI^3:DKES5 t d^ii^l^s 
(D) TOfOLOGV: unjCTio'A'n 

[±±) MCLBCOLE TYPE: vDNA 
(iii; HYPOTHKTICAL s NO 
(iv) AMTICEHSE: KC 
{V> FRACl^EaaT TYPE: 
(vi) ORIGINAL SOURUlSr 

(aci) SEQfUSNCE: MeC^lIPTIONi SEO ID NOiTt 

(iAUGATCCGA C3-CATCA'3AA TGT 23 

(2) TWFORMATION 3XDR SEQ 12 NO:B: 

(1) SEQUENCE CHARACTERISTICS; 
LEN<jTH: 12 bau& palri^ 
{B) TYFi::: nucleic acid 
iC) ETRAKDE3KESS s slcglc 
ID' XCP0L9<?V£ unknown 

(ii) KOLSCULE TiVEi dDlSh 
(ill} HYPOTEETia\Ls MO 
(iv) AKTISEHSE: NO 
(V) FRAGMBKT 7YPB : 
(vi) ORI<3Z»7>J^ S0URC3: 

(Xi> SEQURE3CE DS£CRIPTIOUr SEO 

AATACXALAC AG 12 

\2) INPORMftXIOM FOR SEQ ID NOsS: 

{■!) SEQITRNCE rHAHTUZTERISTICS t 
(AJ LE>IGTH: 29 base pairs 
•31 TTF^i nucleic acid 

(C) r.?RA>inRnNESS : UnkCfiwA 
TOFOIOCY: unxnown 

(ii) MOLECULE TYPK: cT>NA 
riii) HYPOTHBriCAl.: HO 

(iv) ANTISENSEi NO 

(v) FRAGMENT ?YFE t 
(v^) OPJOTTOT, ROfTIRCEi 

(XI ) SEQUENCE DESCRZPTIOF: SBQ IB 130:9: 

OATCT^.GCrG G<3CCGAGCrA GOCCaTTOA ?9 

\2) INFORMATION FOR SEQ ID N0:1G: 

{V; ^3?!QTTRKCK CHARACTERISTICS t 
<A) LENGTH: 5S base pairs 
(&> TVPE: nucleic acid 
<C> STRANDBDfclESS I single 
<D> TOPOLOGY I unknown 
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'ii; VOCSCULE TYPE: CDNA 
Ciii) riYPOniETTCAL: luO 
^\v) ANTISENfiE: NO 
•Ivj FRAGMENT TYPE: 
{vi) OiiiG-NAL SOURCE: 

Ixi) SEQUBNCB DEoCRIPXrOWt 52Q ID NQilO: 

ATAGACGC7G CTG^ACGGCA CCCTTCTCCk ACACTTC7AT ACTGTCACCT AAArC 55 

\2) INTORMATICN KDR StlO iJ N0:11: 

(i) SEOUEHCE CHT^CTBRISTICS : 
WW l.il5lG?3i 2£ base pairfi 
■lU; TVl-'U: nuc'ifiir acid 
(r.:- STRMnczDNEfiS ! Gingle 
[J>i TOPOLCGVs unknown 

(ii^ MOLBCCTLS TYPE s CDNA 
(iii? HYPOTHETICAL: NO 
(iv> AKTISENSB: ^X 
(v) FRAGMENT TYMj 
(VI) ORIGINAL SOURCE: 

<xi:< SEQUENCE CBSCSIPTICN: SEC ID NO: 11: 

CSATrTAGGrO ACACrATAGA AGTCT 25 

{2) INF0R?CATION FOR S30 ID NU:lZi 

(i; SEQUENCE CHARACTERISTICS: 
(Aj LENGTn; 30 base pairs 
(a> TYPE: n-jcle-ic Anid 
{C> STRAKDEDNESS : "Unloiown 
tD> TOFOIfOGY: wkaawn 

iil) TOLBCUr-S TYPES cDMA 
(iii) HY?OTHETICAI. : NO 
iiv) ANTISENSE: KO 
iv) PRRGXJOT' TY?3j 
• vi; ORIGINJ'iL SOURCB: 

ix3) SEQUENCB IJKSCRTPTIOK: . SRQ 13 NO: 12 1 

TGGA6AW3GG TGCC3CCCA3 CMZCQZCthT 30 

(2) IKFORMATTQN FOR fiEQ ID NO J 13 i 

(i) SEOUBNCE CHARACrEKiarlCS : 
iA; L3WGTH; 276 ba^ie pa-im 
(B; TVPE: nutrlfeio acid 
IC) STRASDSDKBSS ; single 
(D) TOPOLOGY: unknown 

(il> XOLECULE TYPES CCMA 

(iii) HYPOltOITlCAL: NO 

(iv) MmsENSSs no 

(v) FRAGMFNT TYPK i 
(Vi) ORKSIHAL SOURCKs 



<xi) SKgURtCCR 0Pi?CRIPT10H: SSQ ID 510 J 13; 
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GCTCTAATAC GACTCSVCrAT AEGGAGACAA 
ATCCCCGGGT AOXJAGCTCG AATTCCT'SGC 
GCCGTTGCTT Ct^CViCGrTC AAATCCCCiCT 
TCACCOGACA AACAACA3AT ARAAAAAAAG 
rrZGATGCCT GGCiaAXrCGT ATTCTATTCT 



GCTTGCATGC CrGGAQSTCG ACTCTAGAGQ SO 
AGTTTATSGC GGGCffiCi^C CACUCTCCGG 120 
CCCCCGCATT TGTCCTAC7C AfJaAtlAGCOT IBO 
CCCAOTCTIT C3ACTG30CC TTTCGTTTTA 240 
ATAGTG 2'iC 
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